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Source Coding for a Simple Network 


By R. M. GRAY and A. D. WYNER 
(Manuscript received March 15, 1974) 


We consider the problem of source coding subject to a fidelity criterion 
for a simple network connecting a single source with two receivers via a 
common channel and two private channels. The region of attainable rates 
ts formulated as an information-theoretic minimization. Several upper 
and lower bounds are developed and shown to actually yield a portion of 
the desired region in certain cases. 


I. INTRODUCTION 
1.1 Informal statement of the problem 


To fix ideas, let us consider the following problem. Suppose that we 
are given a data source whose output is a sequence U1, U2, ---, that 
appears at the source output at the rate of 1 per second. The {U;}f 
is a sequence of independent copies of the discrete random variable 
U, with probability distribution Pr {U = u} = Q(u), uC vU a finite 
set. Our task is to transmit this data sequence over a communication 
channel having a capacity of C bits per second so that it is represented 
at the output as Ti, Ue, ---, CU. We assume that the data are trans- 


The work of R. M. Gray was supported in part by NSF Grants GK-5452 and 
GK-31630 and by the Joint Services Program at Stanford Electronics Laboratory 
and U.S. Navy Contract N0014-67-A-112-004. Parts of this work were performed 
while A. D. Wyner was visiting Stanford University ae the partial support of the 
ISL Stanford Affiliates. 
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mitted over the channel in blocks of length n, and allow processing at 
both the channel input and output (encoding and decoding). We define 
the ‘error rate” as 


1s : 
A= E— ¥ da(U:, Uz), (la) 
Nkhk=1 . 
where © 
0, u= %, 
duu, = | 58 (1b) 


is the Hamming metric. Thus, A is the average fraction of data digits 
delivered in error. 

The question we pose is: What is the smallest capacity C such that 
(for n sufficiently large) we can transmit the data through the channel 
and achieve an arbitrarily small A? The well-known answer to the 
question is that the minimum capacity C is the entropy H(U), de- 
fined by* 


H(U) = — XS Q(u) log Q(u). (2) 
vEew 


Now consider the case where the random variable U is a pair (X, Y) 
where x € Landy € Y. We have 


Q(u) = Q(z, y) = Pr {Xx = &, Y= y}; 
and 


HO) TA) 2 Qe, y) log Q(a, y). 


Setting U = (X, Y), A [as defined in (1)] is the fraction of pairs 
delivered in error. Thus, we conclude that H(X, Y) is the minimum 
channel capacity required to transmit the source output {(Xzxz, Yx)} 
with the error rate A arbitrarily small. 

Next, let us assume that, as above, U = (X, Y), but that it is only 


required to transmit the sequence {X;,} through a channel having a 
capacity Ci, and to deliver it at the channel output as {X;}. Let 


1 n 
Ax = F- De du(Xx, Xx) 
Nk=l1 
be the error rate for a system with block coding of block length n. 
The special assumption here is that the random sequence {Y;}f-=1 is 


available to the encoder and the decoder. See Fig. 1. 


* All logarithms in this paper are assumed to be taken to the base 2. 
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{xi} 


; ENCODER CHANNEL DECODER 


Fig. 1—Source coding with side information. 


fe} 


Again we ask: What is the minimum capacity Ci required to trans- 
mit {Xz} with Ax arbitrarily small (with n sufficiently large)? The 
answer!? is that the minimum C;, is the “conditional entropy,” 
H(X|Y), defined by 





ae Q(x, y) 
=~ » QrW)LX Qxiy(xly) log Qx\y(x|y)], (8a) 
where 
Qr_y) = Pr{Y=y} = Y Qz,y) (3b) 
rex 
and 
Qxir(aly) = Ze = pr {x = a|¥ = yh. (3¢) 





Qr(y) 


Note that H(X|Y) + H(Y) = A(X, Y). 

Let us remark that the above still holds if, instead of deliver- 
ing {Y.} to the decoder, we delivered a sequence {Y,}, where 
Ay = E(1/n)Xh-1du(Y:, Y,) can be made arbitrarily small. Thus, 
the capacity of the “‘side channel’? must be at least H(Y). 

Finally, we turn our attention to the problem to which this paper is 
devoted. Let the source output be {(X:, Yx)}e=1, as above. We assume 
here, however, that there are two receivers. Receiver 1 is interested in 
obtaining a reproduction {X,} of the sequence {X;}, and receiver 2 is 
interested in obtaining a reproduction {Y;} of the sequence {Y;}. 
Assume further that a network consisting of three channels is avail- 
able, as in Fig. 2. The first of these channels is a “‘common’’ channel 
(with capacity Cy) that connects the transmitter to both receivers, and 
the other two are “‘private’’ channels that connect the transmitter to 
each of the two receivers (with capacities Ci and C2). Assuming that 
we use block coding with block length n, the error rates are 


ee Ee S: du(Xr, 21) (4a) 
k=1 
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Fig. 2—Source coding for a network. 
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and 


Api se dueY PO, (4b) 
Nk=1 


We say that a “rate-triple’”’ (Ro, Ri, R2) is achievable if, for any triple 
of channel capacities (Co, C1, C2) for which C; > R; (¢ = 0,1, 2) and 
any e > 0, transmission over the network of Fig. 2 (with these 
capacities) is possible (with n sufficiently large) with Ax, Ay S «. 
Our problem is the determination of the set ® of achievable rate-triples. 

Before stating our results, we digress to give a formal and precise 
statement of the problem as well as some other specialized information. 
This digression can be omitted by the casual reader. 


1.2 Digression—formal statement of the problem 


Let { (Xi, Y:)}#=1 be a sequence of independent drawings of a pair 
of random variables (X, Y), X € X, YE Y. L and ¥ are finite sets 


and Pr{X =2,Y = y} =Q(a,y), rE xX, ye Yy. The marginal 
distributions are 


Qx(xz) = L Q(z, y) and = Qy(y)= LX Q(z, y). 
yEY : : zEN 


Often, when the random variables are clear from the context, we write 
Qx(x) as Q(x), etc. Define, for m = 1, 2, ---, the set 


Im = (0, 1,2, «++, — 1}. (5) 
An encoder with parameters (n, Mo, M1, M2) is a mapping 


fe: X" X Y* > Iu, X Iu, X Im, (6) 
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Given an encoder, a decoder is a pair of mappings 
fi: Iu, X Iu, X" (7a) 
{8° : Imp X Iny > Y". (7b) 


An encoder-decoder with parameters (n, Mo, M1, M2) is applied as 
follows. Let 
fz(X, Y) = (So, 81, 82), (8a) 
where 
X = (X1, ---, Xn) and Y = (¥1,---, Yn). 
Then let 


X = {$ (Go, S:), (8b) 
= fh (So, Si). (8c) 
The resulting error rate is 
= max (Ax, Ay), (9a) 
where 
1 n 
Ax = R= yy dy(Xx, Xi); (9b) 
Nk=l 
Ay => zi 3 dy(Yx, Y,), (9c) 
Nk=1 


du(-,-) is defined by (1b), and X;, Y;, are the kth coordinate of x 
and Y, respectively. The Hamming distance Dy(u, v) between the 
n-vectors u and v is the number of positions in which u and v differ. 
Thus, Ax = E(1/n)Dy(X, Y) and Ay = E(1/n)Du(Y, Y). 

The correspondence between the encoder-decoder pair (or ‘‘code’’) 
as defined here and the communication system of Fig. 2 should be 
clear. Note that the capacities of the channels in that diagram must 
be at least C; = (1/n) loge M; (¢ = 0, 1, 2). 

A triple (Ro, Ri, Re) is said to be achievable if, for arbitrary 
e > 0, there exists (for n sufficiently large) a code with parameters 
(n, Mo, M1 ,M_) with M; S 2"(#i+, 7 = 0, 1, 2, and error rate A S e«. 
We define ® as the set of achievable rates. Oui main problem is to 
ascertain the region &. 

It follows from the definition that ® is a closed subset of Euclidean 
three-space and the ® has the property that 


(Ro, Ri, Re) © R— (Ro + €0, Ri + «1, Re + €2) € AR, (10) 
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= 0, 7=0,1,2. The region & is Phererore completely defined by 
giving its lower boundary ®, where 


R & {Ro, Ri, Re) cE GR: (Ro, R,, R:) = R, (11) 
R; S Rit = 0,1, 2) > R; = Rit = 0, 1, 2)}. 
It follows immediately that ® too is closed. 

It can also be verified by a simple “time-sharing” argument that ® 
is convex (see appendix). This leads us to the following equivalent 
formulation of the problem. Let a; = 0,7 = 0, 1, 2 be arbitrary. Then 
define 

T1(ao, Q1, a) _ min (aoRo + oki + a2Rs). 


(Ro, F1,R2) CR 


Then it follows from the convexity of ® that the lower boundary ® 
is the upper envelope of the family of planes }°ga:R; = T'1(ao, a1, a2). 

We can think of 7'1(ao, a1, a2) as the minimum cost of transmitting, 
using a code with rate-triple (Ro, Ri, Re) over the network of Fig. 2, 
when the cost of transmitting a bit per second over the common channel 
is ao and the costs of transmitting a bit per second over the private 
channels to receivers 1 and 2 are a; and a, respectively. Now, since 
information sent over the common channel (in Fig. 2) can alternatively 
be sent over both private channels, it is never necessary to consider 
the case where the sum of the costs of a bit per second on the private 
channels a; + az < a, the cost of a bit per second on the common 
channel. Similarly, we need never consider the cases where a; > ao, or 
a2 > ao, since information transmitted over a private channel can 
alternatively be sent over the common channel. Since we can nor- 
malize ao as unity, the following theorem should be plausible. A com- 
plete proof is given in the appendix. 

For R = (Ro, Ri, Rz) satisfying R; 2 0, and @ = (a1, ae) arbitrary, 
let the ‘‘cost’’ be defined by 


C(a, R) = Ry + aiki + ack. , (12) 
With a held fixed, let 
T(a«) = min C(a, R). (13) 
RER 


The indicated minimum exists because ® is closed. For @ arbitrary, 
let S(w) be the set of R € @ that achieve T(a) = C(a, R). 
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Theorem 1: 
“RC U 8a), 
aEee@ 


(i) U Se) C@, 
wea’ 


where the boundary & is defined in (11), @ is the set of a = (a1, a2) that 
satisfy 
0S aa. £1, ay ta, 2 l, 


and Q’ ts @ with the elements (0, 1) and (1, 0) deleted. 


Remarks: 


(1) (0, 1) and (1, 0) are the only pairs in @ with zero elements. Thus, 
@ and @’ are nearly identical. 

(2) The theorem implies that ® is upper envelope in (Ro, Ri, R2)- 
space of the family of planes defined by 


Ro + aki + ake = T(e), 
aCe @. 


1.3 Upper and lower bounds on G 
1.3.1 Lower bounds 


We can immediately give some lower bounds to the region ®. We 
state them as 


Theorem 2: If (Ro, Ri, Re) € @, then 
(a) Ro + Ry + R, = H(X, Y), 


(6) Rot Ri 2 A(X), 


(c) Ro+ Rk: 2 H(Y). 


Proof: Suppose that (Ro, Ri, R2) C R. Then, for arbitrary « > 0, we 
can (for sufficiently large block length n) reproduce {X;,}, and {Y;} 
with arbitrarily small Ax, Ay, with capacity triple (in Fig. 2) 


(Co, C1, C2) = (Ro + €, Ri + €, Re + €). 


That is, with a code with M; = 2"°, 7 = 0, 1, 2. 
Since the total capacity of the three channels is Co + Ci + C2, we 
must have 


Cot Cit C2 = Ro + Rit Re + 8 2 A(X, Y). 
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Letting «e— 0, we have established (a). Inequality (6) follows in an 
identical way on observing that the common channel (with capacity 
Co) and the private channel to receiver 1 (with capacity C1) together 
transmit {X;}. Inequality (c) follows, similarly. 

Let us remark that inequality (a) is an expression of the fact that 
a communication system with the constraints imposed in Fig. 2 
cannot perform better than in the ‘“‘best of all possible worlds’’ situa- 
tion in which the receivers can collaborate. It is therefore called the 
“Pangloss bound.” The set of triples (Ro, Ri, Re) that satisfy }°2R; 
= H(X, Y) are called the ‘‘Pangloss plane.” Corresponding to rate- 
triples that lie on the intersection of ® and the Pangloss plane, the 
approximately H(X, Y) bits per second that characterize {X,, Yx} 
can be split up into three parts (corresponding to the information 
transmitted over the three channels in our network) such that {X;, Y:} 
can be essentially perfectly reconstructed by the three receivers in the 
network. In this situation, the information transmitted over the com-. 
mon channel represents a kind of “core” process. Furthermore, the 
smallest Ro, such that (Ro, Ri, Re) € R and lies on the Pangloss plane 
(for some Ri, Re), can be thought of as a measure of the ‘‘common 
information” of {X,} and {Y,}. This point is explored thoroughly in 
Ref. 3. 


1.3.2 Some easily achievable rate-triples 
We now assert that certain rate-triples are achievable. 
Theorem 3: The following triples belong to @: 
(A) Ro = A(X, Y), Ri = R2 = 0 
(B) Ryo = 0, Ri = A(X), R, = H(Y) 
(C) Ro = H(Y), Ri = H(X|Y), Re = 0 
(D) Ry = H(X), Ri = 0, R, = H(Y|X). 


Proof: To achieve (A), simply transmit {(X;, Y;)} over the common 
channel (and do not use the private channels). To achieve (B), trans- 
mit {X,} and {Y;,} over the private channels to receivers 1 and 2, 
respectively (and do not use the common channel). To achieve (C), 
transmit {Y;} over the common channel (requiring a capacity of 
about H(Y)), and deliver { Y;} to receiver 1 to use as side information 
for transmitting {X;,} over the private channel to receiver 1. This will 
require a capacity of about H(X|Y). We do not use the private channel 
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to receiver 2. Triple (D) can be achieved as in (C) by reversing to roles 
of X and Y. 

Let us remark that points (C) and (D) lie on the Pangloss plane 
(i.e., they satisfy relation (a) of Theorem 2 with equality), since 
H(X) + H(Y|X) = A(Y) + A(X|Y) = A(X, Y). Furthermore, be- 
cause of the convexity of @, all triples that are linear combinations of 
triples (A) to (D) are also members of &. The situation is summarized 
in Fig. 3. The plane labeled “(a)” in the figure is the Pangloss plane 
defined by Ry + Ri + R. = H(X, Y). Theorem 2(a) states that the 
region ® (and therefore its lower boundary ®) lies above this plane. 
Similarly, Theorem 2(b, c) states that ® and ® lie above the planes 
labeled ‘‘(b)”’ and ‘‘(c)” in Fig. 3. 

Now the points labeled ‘‘A,” ‘‘B,” “C,” and ‘‘D” in the figure are 
points (A) to (D) respectively in Theorem 3. As we mentioned pre- 
viously, points C and D (as well as A) lie on plane a. Thus (from the 
convexity of ®), the triangle ADC lies in ® and must therefore be part’ 
of the lower boundary @. Further, since points D and B lie on plane b, 





Fig. 3—Estimates of rate-region &. 
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the line DB is part of &. Similarly, line BC is part @. Finally, since 
points B, C, and D are achievable, so are the points on the triangle 
BCD. Thus, the only unknown part of the lower boundary &@ lies in the 
(upside-down) triangular pyramid with base BCD and apex at point 
E (the intersection of planes a, b, c). The coordinates of point FE are 
easily seen to be (Ro, Ri, Rz) = [1(X; Y), H(X|Y), H(Y|X)]. 

Let us remark here that there is one special source distribution 
Q(x, y) for which point E is achievable.t In this case, the entire bound- 
ary region ®& lies on planes a, b, c. This special case is when X, Y can 
be written X = (X’,V), ¥Y = (Y’, V), where X’ and Y’ are condi- 
tionally independent given V. Then J(X,Y) = H(V), H(X|Y) 
= H(X'|V), H(Y|X) = H(Y'|V), so that point EF is Ry = H(V), 
R, = H(X'|V), R2 = H(Y’|V). Clearly, if, in the system of Fig. 2, 
we transmit V over the common channel and X’ and Y’ over the two 
private channels, we can reconstruct X = (X’,V) at receiver 1 
and Y = (Y’,V) at receiver 2. This requires a capacity triple 
Co= AV) +6 Cr=A(X'|V) +e Co=HA(Y’|V) +e (e>0 
arbitrary), so that point is in fact achievable. 

We now give a characterization of the region ® (and therefore of ®) 
in terms of information theoretic quantities. This characterization is, 
in fact, the main result. 


1.4 Characterization of ®—the main result 


Suppose we are given Q(z, y), x€ X, yC Y, an arbitrary prob- 
ability function, where X, Y are finite. Let © be the family of prob- 
ability functions p(x, y, w), where x € LX, yEC Y, w CW, and W is 
another finite set, for which 


D p(x, Y; w) = Q(z, y), LrEXYE Y. (14) 
wEew 


Each p € @ defines discrete random variables X, Y, W in an obvious 
way. For each p € @, define the subset of Euclidean three-space 


AR) = {(Ro, Ri, Re): Ro = I(X,Y;W), Ri 2 H(X|W), 
R, = H(Y|W)}, (15a) 
and then let 
R= (U aye, (15b) 
PEC 


tM. Kaplan has shown that, in fact, this special case is the only one for which 
point # is achievable. 
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where ( )¢ denotes set closure. Then our main result (the proof of 
which is given in Section IIT) is 


Theorem 4:8 = @*. 
Remarks: 


(1) Let us define @7 as the family of “test channel” transition proba- 
bilities. That is, ®r is the family of all p:(w|z, y) (et E X,y EC Y,w Ew), 
where W is a finite set, and for each (2, y), p:(w|z, y) is a probability 
function on ‘W. Corresponding to each p; € ®r, we have p(z, y, w) 
= Q(z, y)p:(w|x, y) € @. Further, for each p € @, we have p:(w|z, y) 
= [p(z, y, w)/Q(x, y) | € @r. Thus @ is in 1-1 correspondence with @7. 

(2) Since ® is convex, Theorem 4 implies that ®* is convex also. 

(3) Theorem 4 can be invoked to show that T'(e) defined in (13) is 
also given by 


T (a) = a [1(X,Y¥;W) + a:H(X|W) + a:.H(Y|W)]. (16) 


Thus, from Theorem 1, the lower boundary @, and therefore @, is 
essentially determined by 7a) given by (16). 

(4) Theorems 2 and 3 can be verified easily by using Theorem 4. 
Thus, if (Ro, R1, Re) € R, from Theorem 4 for arbitrary e > 0 we can 
find a triple of random variables X, Y, W such that | 


Ro+ Rit R2 = 1(X,Y;W) + H(X|W) + H(Y|W) — « 


= H(X, Y) + (H(X|W) + A(Y|W) — W(X, Y|W)] — « 
= A(X, Y) -«—- H(X, Y), ase—0Q. (17) 


This is Theorem 2(a). The second inequality in (17) follows from the 
fact that the entropy of a pair of random variables is less than the sum 
of the respective entropies. Part (b) of Theorem 2 follows from 


1(X,Y¥;W) + H(X|W) =1(X;W) +1(¥; WX) 
+ H(X|W) = 1(X;W) + H(X|W) = A(X). (18) 


The first equality in (18) follows from a standard identity [Ref. 4, 
Eq. (2.2.29) ]. 

Theorem 3 follows from Theorem 4 on taking W as follows: (A) 
W = (X, Y), (B) W =0, (C) W = Y, (D) W = X. 

(5) Although Theorem 4 characterizes ® and ® by an information 
theoretic minimization, it must be emphasized that the minimization 
is not, in general, easy. In fact, there is no nontrivial case for which 
we have succeeded in calculating the entire boundary ® analytically. 
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Its major utility at this point has been in finding upper bounds on & 
by guessing at a p or p; and calculating the corresponding triple 
[I(X, Y;W), H(X|W), H(Y|W)], which must lie above ®. See the 
example below. The problem of computation of ® both analytically 
and numerically is still open.t 

(6) For p € @, we can define the quantities 


Boy(w) = Pr {X =2,Y = y|W = v}, reExyEeEYy,wew, 


which can be thought of as the transition probabilities of the “backward 
test channel.”’ For a given (x, y), we can think of 62, = Bz,y(W) asa 
random variable. Of course, 62, must satisfy 


Bry 2 0, (19a) 
DD Bz = 1, (19b) 

and 
EBry _ Q(z, Y); (19c) 


where the expectation is taken over the distribution for W. Further, 
1(X, ¥;W) = H(X, Y) — H(X, Y|W) = H(X,Y) 
| — EZ Balog, (20a) 
x,y Bsy 


H(X|W) = ED a log sas H(Y|W) = BY 6? log 55 (20b) 
where 
Bi? = 2X Bey =Pr{X =z|W}, and 6? = 2X Bey 
=Pr{¥ =y|W}, (20c) 


and the expectation is taken over the distribution for W. Using this 
idea, it is possible to characterize, for example, T'(«) as follows (see 
Ref. 3, for a precise proof of this characterization). Given Q(a, y), 
xe x,y EC Y, define @ as the family of collections of random vari- 
ables, {B2,}, ¢ © X, y € Y, which satisfy (19). Then 


T(«) = min [1(X, ¥;W) + aH (X|W) + oH (Y |W), 


T One reason for the difficulty is that I(X, Y; W) + aH(X|W) + mH (Y|W) is 
apparently neither convex nor concave in pr. 
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where [(X, Y;W), H(X|W), H(Y|W) are given by (20) and the 
minimum (which can be shown to exist) is over all sets {8,,} in B. 

This characterization may have value in the computation problem, 
since the quantities in (20) are linear functions of the joint distribution 
function for the {8.,} and the constraints of (19) are also linear in- 
equalities in this distribution function. Thus, calculation of T(q@) is a 
linear programming problem. 

(7) If p € @ is such that X and Y are conditionally independent 
given W, then H(X,Y|W) = H(X|W)+ H(Y|W). Thus, with 
Ry = (X,Y; W), Ri = H(X|W), R. = H(Y |W), 


Ro+ Ri + Rz = A(X, Y) — A(X, Y|W) + H(X|W) + A(Y|W) 
= H(X, Y), 


and (Ro, Ri, R2) € ® and lies on the Pangloss plane. Reference 3 
shows that this class of triples (corresponding to a p € @, with X, Y 
conditionally independent given W) completely characterizes the inter- 
section of ® and the Pangloss plane. 


1.5 An example 


As an example of the preceding, let us consider the special case where 
the source is the ‘‘doubly symmetric binary source’? (DSBS), where 
x= y = {0,1}, and 


Q(z, y) = 3(1 — po)bzy + zpo(l — 62,y), 2,y = 0,1, (21) 


and the parameter po satisfies 0 S po S 4. We can think of X as being 
an unbiased binary input into a binary symmetric channel (BSC) with 
crossover probability po, and Y as being the corresponding output, 
or vice versa. To get a clearer picture of the set of achievable rates &, 
let us restrict ourselves to the plane in (Ro, Ri, Re)-space, where 
Ri = Fz. The intersection of ® and this plane can be plotted in a two- 
dimensional picture. 

Let us first take a look at the implications of Theorems 2 and 3. In 
this source, 


A(X) =H(Y)=1, H(X|Y) = H(Y|X) = h(po) 


and 
H(X, Y) = H(X)+ H(Y|X) =1+h(p), 


where 
h(dA) = —dAlog A — (1 — A) log (1 — A), OSrAS81 (22) 


is the entropy function. [We take h(0) = A(1) = 1.] With Ri = R,, 
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Theorem 2 yields 
Ro+2hi 21+ A(po), (23a) 
Rot hi 21. (23b) 
Thus, ® and therefore the lower boundary ® must lie above the lines 
labeled a and b in Fig. 4. 
Now Theorem 3 implies that points A[Ro = 1+ h(p.), Ri = 0], 
and B(Ry = 0, Ri = 1) are achievable, so that any point on the line 
connecting them is also achievable. But we can do better. Let us drop for 


a moment the requirement that R; = Rz. From Theorem 8, C and D, the 
points [Ro = 1, Ri = h(po); R, = 0] and [Ro = 1, Ry = 0, R, = h(po) | 


1 ~ h(Po) = H(X,Y) 


Ro 





Ry, =R2 


Fig. 4—Estimates of rate-region ® for the DSBS. 
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are achievable. Thus, the point in (Ro, Ri, Rz)-space halfway between 
them is also achievable. But this point, 


[Ro = 1, Ri = Zh(po), kz = 3h(po) 1; 


satisfies Ri = Re, and is therefore of interest to us now. Point F in 
Fig. 4 is therefore achievable, and therefore so are line segments AF 
and FB. But line segment AF coincides with line a, so that it must be 
on the boundary @. So far, the unknown part of the boundary curve 
@ lies in triangle FHB. We can do better, however, by using Theorem 4. 

Theorem 4 asserts that any triple in ®‘), p € @, is achievable. We 
therefore guess at a p € @ that defines random variables X, Y, W, 
and then assert that the triple Ro = I(X, Y;W), Ri = H(X|W), 
R, = H(Y|W) is achievable. Since we choose a p € @ such that 
Ri = Rez, this triple is of interest in our present discussion. The p € @ 
we have chosen is (with W = {0,1}) given by Table J. The quantity 
pi = $(1 — V1 — 2 po). One way of characterizing p is to think of W 
as an unbiased binary input and X, Y the respective outputs of two 
independent BSC’s, each with crossover probability p:. Note that 
these two BSC’s in cascade are equivalent to a single BSC with cross- 
over probability, 2p1(1 — pi) = po. 

With X, Y, W so defined, X, Y are conditionally independent given 
W, so that (Ro, Ri, Re) lies on the Pangloss plane. [See remark (6) 
following Theorem 4. ] We have 


Ro = 1(X, ¥; W) = H(X, Y) — H(X, Y|W) 
= 1+ h(po) — 2h(p1), 
Ri = R. = H(X|W) = h(pi). (24) 


This is point G in Fig. 4. Line segment AG is therefore on the boundary 
&. From these simple arguments, we see that the unknown part of 
the boundary @ lies in the triangle GHB. 

To obtain a still tighter bound on ®, we employ the same technique 
as above—i.e., “guessing” at a p C ® and then deducing that a”? 


Table | 
AY 
we 00 01 10 i 
0 3(1 — p:)? 3pi(1 — pr) 3p1(1 — pr) 3D} 
1 3D; 4pi(1 — pi) 3pi(1 — pu) 3(1 — pu)? 
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Table Il 


XY 
ON 00 01 10 11 
1 1 
: (ao) Poe Pos Aes 
1 1 
; 3 (8-5) Pare Pus 3 (1-8 -F 


C @. Let B be a parameter for which 
pi = 3(1 — V1 — 2p) SBS 5. (25) 


Then let W = {0,1}, and p(a, y, w) be given by Table II. Then the 
triples (Ro, Ri, Re )C€ ®, where 


Ryo = 1(X,Y;W) = H(X, Y) — A(X, Y|W) 
= 1+ hoo +5 (1 — 6-8) log 5 (1 -s- 2) 


Pog P24 1(g _ Po\igg fg — Bo 


and | 
Ri = R. = H(X|W) = H(Y|W) = hf). (26b) 


For 6 = pi, the triple of (26) coincides with that of (25), i.e., point G 
in Fig. 4. For 6 = 3, the triple of (26) is Ro = 0, Ri = Re = 1, ie, 
point B of Fig. 4. As 6 increases from p, to 3, the family of rate-triples 
of (26) generate a curve c, which lies below the line GB and therefore 
constitutes a tighter upper bound on &. We conclude that the unknown 
portion of @ lies in the shaded region in Fig. 4. 

In Section 2.5 we give some insight into how we ‘‘guessed”’ at these 
distributions p € @. 


Il. GENERALIZATION TO A FIDELITY CRITERION 


In this section we formulate a generalization of the problem of Sec- 
tion I in which we require that the source sequences {X;,} and {Y;} 
be reproduced to within a specified fidelity criterion and not, as in 
Section I, essentially perfectly. The proofs of the main theorems ap- 
pear in Section III. 


2.1 Definitions and formulation of the problem 


Let {(X;, Yx)}@=1 be a sequence of independent drawings of a pair 
of random variables X € X, Y € Y, where the “source alphabets” 
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x and ¥ are either discrete sets, the reals, or arbitrary measurable 
spaces. We assume that we are given a probability law that defines 
(X, Y). If X and ¥ are discrete, then we write 


Q(z,y) = Pr{X =2,Y = y}, LEX, ye Y. 


If x, Y are the reals, then (X, Y) may be defined by a probability 
density Q(z, y), —-~ <2,y < ©. For arbitrary measurable X, 4, 
the pair (X, Y) is defined by a probability measure Q on X X Y. The 
marginal distribution for X, Y will be defined similarly by Qx, Qy 
respectively. . 
As in (5), define the set J, = {0,1,---,m — 1} form = 1,2,->>. 
An encoder with parameters (n, Mo, Mi, M:) is [as in (6) ] a mapping 


fe: X" X y* > Iu, X Im, X Im, (27) 


We assume that the sequences {X;} and {Y;} are to be reproduced as 
sequences of elements of sets £ and J, respectively, called “repro- 
ducing alphabets.”’ Thus [as in (7) ], corresponding to a given encoder, 
a decoder is a pair of mappings 


{S°: Iu, X Iu, > &*, (28a) 
a Iu, X Im,> Yr. (28b) 


Let us adopt the convention of denoting n-vectors with bold-face 
type (either upper or lower case) and the components as the same sub- 
scripted letter in ordinary type. For example, u = (us, +++, Un). 

An encoder-decoder with parameters (n, Mo, M1, M2) is applied as 
follows. Say 


fz (X, Y) = (So, Si, Se), (29a) 
where X © &*, Y € Y", and (So, Si, S2) is a triplet of indices. Then set 
X = f5(So, 8:2), Y= f$ (So, S2), (29b) 


where X € &", Y € Y. The encoder-decoder is said to have average 
distortion (Ax, Ay), where 


Ax = ED,(X,X%), Ay = ED.(Y,Y), (30a) 
and the single-letter distortion functions are defined by 
A 1 vt A 
Di(x,%) = = Lo di(ze, £x), (30b) 
Nk=1 
P L 2 . 
Dilys ¥) = | Le dalyes Gu), (30c) 
k=1 
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xE Xx FEC LyC Y*,F7 € Y", and di(-, -) isa given nonnegative 
per-letter distortion function for the X-receiver and de(-, -) is a given 
nonnegative per-letter distortion function for the Y-recelver. An 
encoder-decoder with parameters (n, Mo, Mi, M2) with average dis- 
tortion (Ax, Ay) is said to be a code (n, Mo, Mi, Mz, Ax, Ay). 

A rate-triple (Ro, Ri, Re) is said to be (Ai, Az)-achievable if, for 
arbitrary « >0O and vn sufficiently large, there exists a code 
(n, M,, M;, M2, Ax; Ay) with 


M,; S QrRito, 4= 0,1, 2, 
and 
Ax SAit+ 6 Ay S$ A:+ «. 


The set of all (Ai, A2)-achievable rate-triples is called @(Ai, Az). Our 
main problem is to ascertain ®(A1, Az), A1, Az 2 0. Clearly, this gen- 
eralized problem reduces to the problem of Section I, if x = &, 
y = Y, d, = d: = dy, and A; = Az = 0. As in Section I, the region 
®(Ai, Ac) is completely defined by the boundary (Ai, Az), where 
® = @(Aj, Az) is defined in (11). Further, we show in the appendix 
that ®(A1, Az) is convex and that Theorem 1 holds with R = ®(Aj, A2). 


2.2 Rate-distortion functions and conditional rate-distortion functions 


A major tool in this study is rate-distortion theory. Specifically, 
joint, marginal, and conditional rate-distortion functions (or simply 
“rates’’) are used both in evaluations and bounds. These functions 
and their properties are dealt with in Refs. 1, 4, and 5. Here we only 
summarize some pertinent definitions and properties. 

The marginal, joint, and conditional rates are defined as follows. 
Consider first the case where the alphabets x, £, Y, ‘J, are finite and 
Q(x, y), Ox(x), Qy(y) are probability functions. Then the (joint) rate- 
distortion function is defined by 


Rxy(Ai, A2) = min I(XY; XY), (31) 


where the random variables XY are defined by a “‘test-channel” 
qe(2; 9|x, y)—i.e., a probability function on & X Y for every (2, y) 
€ LX X Y. The information in (81) is calculated for the joint distribu- 
tion 

Pr {Xx =2,Y= y, X = &, P= g} = Q(z, y)qr (4, 9 |z, y). (32) 


The minimum in (81) is taken with respect to all test channels gq; such 
that Edi(X,X) S As, Ed.(Y, Y) S Ao, where the expectations are 
taken with respect to the distribution (32). The minimum always 
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exists. Similarly, the marginal rates are defined by 


Rx(Aj) _ min I(X; X), (33a) 


a; (2/2) :Edy(z,2) SAy 


Ry(A2) = min I(Y; ¥), (33b) 


a:(ily):Ed(¥,¥) SA2 


where the expressions in (33) are interpreted analogously to that of 
(31). Detailed discussions of these quantities and their significance 
can be found in Refs. 1 and 4. 

Another quantity that plays a crucial role in our study is the ‘“con- 
ditional rate distortion function.” Let X, Y be finite, and let Q(z, y) 
be given. Let p(x, y, w) be a probability function on X& X Y XW, 
where W is a finite set such that >) up(z,y, w) = Q(x,y). Then 
p(x, y, w) defines a triple of random variables X, Y, W, where the 
marginal distribution for X, Y is Q. The conditional rate-distortion 
functions are defined as 


Rxyjw(A1, As) = minI(X, ¥; XY|W), (34) 


where the minimum (which always exists) is taken with respect to all 
test channels q:(2,9|2, y, w) such that Ed,(X,X) S Ai, Ed.(Y, Y) 
< A». The conditional information in (34) is defined in Ref. 4, p. 21. 
The conditional rates Rx;w(A1), Ry;w(Az) are defined analogously. A 
detailed discussion of conditional rates is given in Ref. 5. Of course, 
these definitions are meaningful if X = W or Y = W. Roughly speak- 
ing, Rx ,y;w(A1, Az) is the channel capacity required to transmit X, Y 
and to reproduce it as X, Y to within an average distortion (A1, As) 
when both the transmitter and receiver know W. 

We shall need several properties of the conditional rate-distortion 
function in the sequel. The first is given in Ref. 5. For A 2 0, 


Rx|w(A) = min a Pr {W = w}Rx\wew(Aw), (85) 


where Rx|w-=w(-) is the rate-distortion function calculated for a source 
with outputs x € & with probability distribution Px; w(x|w) (the con- 
ditional probability function for X given W = w). The minimum is 
taken over all sets {A.}wew such that >>.Pr {W = w}Ay S A. 

A second fact of importance is that, say, Rx;w(A) is a continuous, 
convex, nonincreasing function of A for A = 0. That Rx;w(A) is non- 
increasing follows from the definition. The proof that it 1s convex 
parallels the proof of the convexity of the ordinary rate-distortion 
function. The continuity of Rx;w(A), A > 0 follows from its convexity. 
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Finally, the continuity of Rx;w(A) at A = 0 follows from (85), and 
the continuity of Rx;w=u(A) at A = 0. 
A third fact we shall need is that, for any X, Wi, We, A = 0, 


Rxiw,w,(A) S Rxjw,(A). (36) 


This follows from Rxjw,w,(A) = inf I(X;X|W.iW»), where the 
infimum is with respect to test channels q:(4|x, wi, w2) such that 
Edi(X, X) S A. Included in this class of test channels are those that 
are independent of wz, i.e., qi(€|z, wi, We) = Qr(4£|X1, wi). This subclass 
is exactly the class of test channels in the minimization for computing 
Rx \w,(A). 

The final property of conditional rates is stated as a lemma below. 
The proof is given in the appendix. 

Let X € X be a random variable with probability distribution 
Qx(x) = Pr (X = x}, where & is a finite source alphabet. Let X be a 
finite reproducing alphabet and let d(z,#) =>0,x7€ x, #E€ & bea 
distortion function. 

Now let {W,}%-1 be a family of disjoint finite sets and let 
{px(x, w)}%=1 be a family of probability distributions on X X W, such 
that 


Dd pr(x,w) = Qx (a). 
WEWk 


The random pairs (X, W,) are defined by 
Pr {X = 2, W, = w} = px (2, w), rE L,w ew, 


Let Rx\w,(A), A = O be the corresponding conditional rate-distortion 
function. 

Next, set W = >°%-1W;, where >- indicates union of disjoint sets. 
Define the probability distribution on X XW: 


DO aw) = = pala w), for wEWi,1 Sk Sn, 


and let (X, W) be the corresponding random pair with conditional 
rate-distortion function Rx;w(A), A = 0. Clearly, p*(-) is a mixture 
of the n disjoint probability distributions {p,}, with prior probability 
1/n. We now state the lemma. 


V 


Lemma §: For arbitrary {Ax}f=1, Ax 2 0, 


Rxiw (7% 4s) = 
ni 


3 le 


>> Rxiwe(Ax). 
k=1 
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We note here that all the above is meaningful for the case where 
Q(x, y) is a probability density function or Q is an abstract probability 
measure. We need only make the obvious correspondences between 
discrete distributions and more general probability measures and 
replace “minimum” in (31), (33), (84), and (35) by “infimum.” 

We conclude this section by taking a look at the specialization of the 
above to the case where d; = dz = dz, the Hamming distortion defined 
in (1b), and A; = A, = 0. Then 

Rxy(0,0) = H(X,Y), Rx) = H(X), Ry) = H(Y), 
Rxy|w(0,0) = A(X, Y|W), Rxiw(0) = H(X|W), 
Ry\w(0) = H(Y|W), 
where the entropy H(-) and the conditional entropy H(-| -) are defined 
in (2) and (8), respectively. Analogous to the relation 


H(X|Y) + A(Y) = A(X, Y) S W(X) + AY), (37a) 

which holds for this special case, the following is established in Ref. 5 
for the general case: 

Rx jy(Ai) + Ry(A2) S Rxy(Ai, As) S Rx(Ai) + Ry(A2). (87b) 


Further, it is shown in Ref. 5, Corollary 3.2, that the left inequality in 
(37b) holds with equality in some neighborhood of the origin { (A1, As): 
0 S Ay, Ae S x}, provided that 

Q2y >0, lleexvyey, (38a) 
and di, de satisfy 

di(a, £2) > di(z,27) = 0, x # &, 


38b 
do(y, 9) > de(y,y) = 0, y #%#. (38b) 


2.3 Characterization of ®(A:, 4:)—the main result 


We first state two simple theorems that are generalizations of 
Theorems 2 and 3. The proofs are analogous to the proofs of Section I, 
and are therefore omitted. Theorem 6(a) is also called the Pangloss 
bound. 

Theorem 6: If (Ro, Ri, Re) © R(Ai, Az), then 

(a) Rootkit kh, 2 Rry (Ai, A»). 


(6) Ro + Ri 2 Rx(Ai). 
(c) Ro + Re 2 Ry(A2). 
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Theorem 7: The following triples belong to (Ai, Az): 

(A) Ro = Rxy(A1, Az), Ri = Re. = 0. 

(B) Ro _ 0, Ri = Rx(Ai), Rz = Ry(Az). 

It is also possible to generalize Theorem 3(C) and (D), but this must 
await presentation of the main result, which we now give. 

Consider first the case where X, ¥ are finite. Let Q(x, y), x € &, 
y © Y be given. Now let @ be the family of probability functions 
p(x, y, w), where x € X, y €C Y, w CW, W is another finite set, and 


 p(z,y,w) = Q(z,y), tERYEY. (39) 
wEeW 


Thus, @ is exactly as in Section 1.4. Now each p € @ defines three 
discrete random variables X, Y, W in the obvious way. For p € © 
and Aj, A; = 0, define the subset of Euclidean three-space 


Ri” (Ai, A2) = { (Ro, Ri, Re): Ro = I(x, Y; W), 
R22 Rxijw(di), Re 2 Ryw(As)}. (40a) 
Then let 


@* (Ar, Ae) =[ U @® (Ay As) ¥, (40b) 
pce 


where ( )¢ denotes set closure. Since Rx;w(A1) and Ry;w(Az) are con- 
tinuous for Ai, Az = 0, we conclude that ®*(Aj, As) is continuous in 
(Ai, As) according to the Hausdorff set metric. This metric p(Si, S2) 
between two subsets S1, S. of a Euclidean space is defined by 
p(S1, Se) = sup inf |r: — re|| + sup inf |lri — roll, 
71E8 72ESp r2E S82 1E 81 
where ||-|| denotes Euclidean norm. 

If Q is either a density or a probability measure, then ®*(Ai, Ae) 
can be defined in an analogous way. In this more general case, we must 
require that the source has the property that there exists an 4 € , 
9 € Y such that 


Edy(X,£) < ©, EHdx(Y,9) < @. (41) 


If X, ¥ are finite, then (41) is always satisfied. We can now state our 
main result. 


Theorem 8: ®(Ay, Az) = R*(Ay, Ag). 
Remarks: 

(1) Theorem 8 reduces to Theorem 4 when &, ¥ are finite, d; = d» 
= du, and Ai = Ag =.) 


1702.) THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 


(2) If we define @r as in remark (1) following Theorem 4 as the set 
of test channels p;(w|z, y), then @r is in 1-1 correspondence with @. 

(3) Since ®(A1, Az) is convex, Theorem 8 implies that ®*(Ai, As) 
is convex also. 

(4) Since Theorem 1 is valid for ®(Ai, As), the present theorem 
implies that 7'(a), defined in (13) is also given by 


T(a) = OE Y;W) + osfxiw(A1) + a2kyjw(d2)]. (42) 


Thus, from Theorem 1, the lower boundary ®(Ai, 42), and therefore 
®(A;, Az), is determined by T(«) given in (42). 

(5) As in remark (4) after Theorem 4, Theorems 6 and 7 can be 
obtained directly from Theorem 8. The steps parallel those in remark 
(4) and will be omitted. We will, however, give the generalization of 
Theorem 3(C) and (D). We state this as follows. The following triples 
(Ro, Ri, Re) © R(Ay, A): 

(C) Ro = Ry(Az), Ri = Rxx(Ai), Re = 0, 

(D) Ro = Rx(Ai), Ri=0, Re = Ry|x(A2), 
where the random variable Y is defined by the test channel that 
achieves the infimum in Ry(A:2) (assuming that the infimum can be 
achieved; if not, a simple modification is possible), and X is defined 
by the test channel that achieves Rx(A:). In the discrete case, we can 


achieve point (C) as follows. Let p:(9|y) be the test channel that 
achieves I(Y; Y) = Ry(A:). Let W = Y and let 


p(x, y, 9) = Q(z, y)piSly) E ©. 


The random variables X, Y, Y are defined in an obvious way by 
p(x, y, 9). Further, since X, Y are conditionally independent given Y, 


(X,Y; ¥) =1(Y;V) +1(X; PY) 
= 1(Y; VY) = Ry(A,). 
‘Also, the conditional rate Ry|y(A2) = 0. Thus, from Theorem 8, with 
W = Y, we have (Ro, Ri, Re) € R(Ai, Az) where 
Ro = I(X, ¥% W) = Ry (Az) 
Ri = Rxyw(Ai) = Rxi7 (Ar), 
R, = Ry |w(Azg) a 0. 


This is point (C). Point (D) is obtained on reversing the roles of X 
and Y. 
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Since R(Aj, Az) is convex, any linear combination of points (A) and 
_ (B) of Theorem 7 and (C) and (D) above also belongs to ®(Ai, A2). 
But there is no guarantee in this case that points (C) and (D) will lie 
on the Pangloss plane. There are cases for which a portion of the 
Pangloss plane is known to be realizable, as is shown in the example 
below. 


2.4 A technique for overbounding @(A:, A:) 


In this section we present an intuitively sensible ad hoc scheme 
for choosing probability distributions p€@ that yield triples 
[I(X, Y; W), Rx;w(A1), Ry; w(Az) | which are often close to or actually 
on the boundary curve ®(Aj, Az). In fact, in many cases this triple 
will lie on the Pangloss plane. 

A natural coding scheme to apply to our network would be to send 
a “coarse” version of the source output (X, Y) over the common 
channel, and then send to each receiver over its private channel only 
the necessary “fine tuning” it needs to meet its fidelity requirement. 
This reasoning leads us to the following family of rate triples that 
belong to ®(Ai, Ar). Assume for simplicity that x, Y, £, Y are finite. 

Let A1, As 2 O be given. Let 1, Be satisfy 


Bi 2 Ai, Bo 2 Ao. 


Now let q:(%, #|x, y) be the test channel that achieves I(X, Y; X, FY) 
= Ryy(61, B2). Then with W = (X, Y) we have that the triple 
(Ro, Ri, Re) = R(Ai, Ao), where 


Ro = I(x, Va W) = Rxy (61, Be), 
and 
Ri = Rxiz¥(Ai), Re = Ry x¥ (Ao). (43) 


Note that the rates corresponding to Theorem 7(A) and (B) and to 
points (C) and (D) in remark (5) following Theorem 8 can be gen- 
erated as special cases of the rate in (43). We do this as follows: 


A: Let (81, B2) => (Ay, A:). 

B: Let 81, 82 be large enough so that Rxy (61, B2) = 0. Then X, VY are 
degenerate. 

C: Let 81 be large enough so that Rx(@1) = 0, and let 62 = Ae. Then 
X is degenerate. | 

D: Let 62 be large enough so that Ry(62) = 0, and let 6; = Ai. 


The power of this technique is illustrated by the following theorem, 
which asserts that under weak assumption the family of rates given 


1704 =THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 


by (48) includes a substantial subfamily that lies on the Pangloss 
bound and therefore on the boundary. 


Theorem 9: Given a source that satisfies 


(i) c= £, y = ¥, &, Y finite, 
(1) Q@,y)>O0, alxreEexuyey, 
(i127) di(x, £) > di(a, x) = 0, all distinct xz, £C X, and dy, 9) 
> deo(y, y) = 0, all distinct y, 9 € Y. 


Then there exists two neighborhoods of the origin 


ni = {(A1, 42): 0 S An, Ae S a} 
nz = {(B1, 82): 0 S Bi, Bo SO}, 


where 0 < a S b, such that, if (Ai, Ae) € m1 and (B1, Be) € ne, then 
Ro + Ri + Re = Rxy(A, Ao), 


where (Ro, R1, Re) ts given by (43). 

The theorem can be proved using Shannon lower-bound techniques! ® 
and, in particular, the proof is similar to that of Theorem 32 in Ref. 5. 
Since the proof requires the generation of special machinery that is 
only tangential to the main ideas in this paper, we have elected to 
omit it. 


2.5 Examples 


(A) Our first example will be the DSBS considered in the example 
of Section 1.5. Here © = Y = {0,1}, and 


Q(z, y) = 21 — po)dzy + 2p0(1 — bz), ty = 0,1, (44) 


where the parameter po € [0, 3 ]. The distortion function will be the 
Hamming metric, i.e., di = dz = dy, where dy is defined in (1b). 
Again, as in Section 1.4, we consider only the plane in (Ro, Ri, R2)- 
space where R, = Ry and Ai = A, = A. We employ the technique of 
Section 2.4 to obtain an upper bound for ®(A, A). 

Making use of Ref. 1, pp. 46-50 (Ex. 2.7.2), we have 


1+ h(po) — 2h(8), OS BS 
L(1 — po) — 2{L(28 — po) + L[2(1 — 8) — pol}, 
M5853 (45a) 


Rxy(8, 8) = 
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where 


pi = 3 — 3V1 — 2po, (45b) 
h() = A\log\A — (1 — A) log 1 — A), OF) Sl, (45c) 
L(A) = —Alog), 0OsAs1 (45d) 


Now, from Ref. 1, the random variables X and Y, which satisfy 
I(X,Y;XY) = Rxy(@, 8) are such that 


Pr {X =2/X = %, Y = 9} = Pr {X = 2|X = ¥} 

= (1 — B)é.,2 + B(1 — 62,2), 2,%, 9% = 0,1 (46a) 
and 
Pr {¥ = y|X =£;¥ = 5} = Pr {¥ =y|Y = 5} 

= (1 — B)éy,5 + B(L — 6,5), ¥,9,% = 0,1. (46d) 
Thus, again from Ref. 1 (p. 46, Ex. 2.7.1), forO S$ A S 8, 


Rx\z7(4) = Rx\x(A) = Ry\x7(4) = RyF(A) 
= h(8) — h(A). (47) 


Thus, we conclude that, for arbitrary 0 S$ A S B S 3, the triple 
(Ro, Ri, Re) = R(A, A), where Ro = Rxy(8, 8) [as in (45) ], and Ri 
= R, = h(®) — h(A). Let us note that, for 0 S A S B S pu, these 
rate-triples (Ro, Ri, Rz) satisfy 


Ro + R; + R, =] + h(po) = 2h(A) = Rxy(A, A), (48) 


and therefore lie on the Pangloss plane and ®(A, A). One special case 
occurs when A = 0, 6 = 7;. This yields the rate-triple of (24)—.e., 
point G in Fig. 4. In fact, the distribution p(z, y, w) € ®, which we 
guessed at in Section 1.5, was obtained by setting W = (X, Y), where 
X, Y are as above for B = 71. 

(B) Our second example is a source where Q(z, y) is a density func- 
tion and x, £, Y, Y are the reals. The ad hoc technique used in the 
previous example (A) will work here with obvious modifications. The 
random variables X, Y in this case will be jointly gaussian with EX 
= HY =0, EX? = EY? =1, and EXY =r, OSrs1. Thus, the 
density 

at 2 — rx 

O09) = sap exp | SEY. cas) 
We take the distortion to be di(-, -) = de(-, -), where 
d(x, 2) = (@ — #4), -—-~m <4, < @. 
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For 0 < 8 < «, it can be shown! that 


1 1-—?r 

sloe( 5" J, 0S#<1-+, 
Rxy(6, 8) = 41 Ler (50) 

ple (aaa) Lars BS 1, 

0, p= tl. 


Further, the random variables X, Y which satisfy I[(X, Y;X, Y) 
= Rxy(8,8),0<BS 1 are such that, given X = x, Y = 9, X and Y 
are gaussian with 


E(X|X =%, Y = 5) = &, 
E(Y|X =%, Y = 5) = 3, 
var (X|X = %, Y = 7) = var (Y|X =%, Y= 7) =8 


Thus,!* for0 << AS 6 <1, 

8. 

A 

Thus, we conclude that, for arbitrary 0 < A S 6 S11, the triple 
(Ro, Ri, Re) € R(A, A), where Ro = Rxry(8, 8B) [as in (50)] and 
Ri = R. = 3 log B/A. Again, observe that for OS AS BS1-—r, 


t=? 
x) = Berl), GD 


1 
Rx x7(A) = Ry\x7(A) = 5 log 





Ro + Ri + Ra = 5 log ( 


and therefore (Ro, Ri, Re) lies on the Pangloss plane and therefore on 
R(A, A). 


iil. PROOF OF THE MAIN RESULT—THEOREM 8 


The proof of Theorem 8 consists of two parts: (7) the “converse” 
part, which asserts that any point in (Aj, Az) belongs to ®*(Aj, Ae) 
and (72) the ‘direct’ or “positive” part, which asserts that any point 
In R*(Ai, Az) belongs to R(A1, Ae). We give the proof for the case where 
x, Y are finite sets. The proof for arbitrary 2X, Y follows in a parallel 
way with integrals replacing sums, etc., in the standard way. We will 
begin with the converse. 


3.1 The converse 


Let (fz, fS°, f°) define a code (n, Mo, M1, Mo, Ax, Ay). We find 
a p*(x;y, w) € @& for an appropriate set W such that 


( joe M,, : joe Mt, * ine Ms) E MO (Ax, Ay) C @*(Ax, Ay). (52) 


SOURCE CODING 1707 


The converse follows on applying the definition of (Ai, A2)-achievable- 
rates and applying the continuity of ®* as discussed in Section II. 

First, let fz(X, Y) = (So, Si, S2) where S; € Im, is a random vari- 
able (¢ = 0, 1, 2). Then we have 


1 w oA (3) J 
=} kon [H (Xs, Vs) — H(Xs, Ye|So, Xz, ++) Xeay Py oy Yea) 


(53) 
These steps are justified as follows: 


(1) From So € Im. 

(2) Standard inequality. 

(8) Definition of J(X, Y; So). 

(4) H(X, Y) = >°.H(X;, Yx) follows from the independence of the 
pairs (X;, Y,), k = 1, 2, ---, n. The rest is also a standard identity. 


Now, for 1 Sk Sn, let Wi = (So, X1, °°, Xe-1, Vi, +++, Vu-1), a 
random variable belonging to, say, Wz, a finite set.‘ Relation (53) is 
then 

“log Me 2 = Ys U(X Yes Wa). (54) 


Next, let X = fofxz(X, Y). Let Aw = Edi(Xi,X,), 1 Sk Sn. 
Then 


4 1 
Ay = EDX, X) = a Pa Aik. (55a) 


We now write 


I @ 1 “ 1 ~ 
7, oa Ma = 7 F&I So) = 7, 1 (XI So) 


(3) 1 x 

= n pa I(XnjX|Go, Xi, vee, Xy-1) 

(4) { on 

= poe, 1 (Kus Xe | So, Xi, +++, Xx-1) 

(5) 1 2 

= ae a Rexy,iv; (Aix) 2 = 4 = > Rxiwwi (Aix), (55b) 


where V; = (So, Xi, eee ig X x1); and W; = (Vi, Y1, trees Y,-1) as 
above. These steps are justified as follows: 


t We can, of course, take Wz = Im, X NX & Yr, 
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(1) Since X is a function of Sy and S;, we have that, conditioned on 
So = So, X can take no more than M, values (since S1 € Iy,). Thus, 
H(X|So = s0) S$ log Mi, all so. 

(2-4) Standard inequalities and identities. 

(5) From the definition of Bixiivx (Ari), since Ed,(Xz, Xz) = Axx. 

(6) Follows from (36). 


A similar derivation yields 


1 | ees 
7, og Ma 2 pa Ry, w,(Aze), (56a) 
where 
= > Ak. (56b) 


We are now ina position to define p*(z, y, w). With W;, defined as 
above, let 
pelt, y, w) = Pr {X,; = 2, Yi = y, Wi = wv}, 
Ee 2,7 SU, we We 
Let 
Pir(x, w) = 2X pelt, y, w), 


P2k (y, w) = Xu pxr(z, Y; w) 


be the marginal distributions for (Xz, W;) and (Yx, Wx), respectively. 
The {W;} can be considered a class of disjoint sets. Let W = DW, 
and define the probability function on © K Y K ‘W 


p* (x, Yy; w) = * pela, Y) w), WwW € W ky 1 mS k < n. 
Since 
id 1 
yD p (a, Y) w) = > > — px(2, Y; w) = Q(z, Y), 
wew k=1 wew, 2 
we have p* € @. The random variables X, Y, W are defined by p* in 
the obvious way. We can think of W as being generated by choosing an 


integer K € [1, n] without bias, and setting W = W; when K = k, 
1s k Sn.A straightforward calculation yields 


1(X,Y;W) = ~¥ UK, Y;W,). (57a) 
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Furthermore, Lemma 5 can be applied to pix, pez to yield 


Rxiw(Ax) S 5 ¥ Reswi(da) (57b) 
Ryiw(Ay) S = Ryywy(A). (576) 


Inequalities (57a, b, c) can be substituted into (54), (55), (56), re- 
spectively, to obtain 


* log M, = 1(X, Y;W) 
1 
7, 108 M, = Rx) w(Ax) 


= log M, 2 Ry\w(Ay), 


which is (52). This completes the proof of the converse. 


3.2 The direct half 


We begin the proof by stating:a lemma concerning conventional 
source coding for a single memoryless source. The source is defined by 
a random variable X € &X, with probability distributions Qx(x), and 
a reproducing alphabet & with distortion function dj(z, 2). As 
above, X = (X1,---, Xn) are n independent copies of X. Let Q(x) 
= Tlf.,Qx(x) be the probability distribution for X. Let R(A) be the 
rate-distortion function. 

A source code with parameters (n,M) may be thought of as a 
mapping f: X"—> © C &*, where card @ S$ M. Let X = (Xi, -++, Xn) 
= f(X). Then D,(X, X) = 1/nX08-1d:(Xx, X;) is a random variable. 
We are interested in the quantity 


T(A; + 6) = Pr {D,(K, X) = A+ 8} = x, OF PCR), (58) 
xGx" 


where (x) = 1, if D[x, f(x)] = A +6, and (x) = 0, otherwise. We 
now state a lemma, which follows immediately from Lemma 9.3.1 and 
inequality (9.3.31) of Gallager.* 


Lemma 10: Let A = 0, and «, 6 > 0 be arbitrary. Then there exist A, 
B > 0 such that for all n = 1, 2, --- there exists a code with parameters 
(n, M) satisfying 
M S 2rtk@)+4, 
and 
T(A + 8) = Pr {D,(X, X) = A+ 8} < Ae2. 
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With the aid of this lemma the standard source coding theorem 
follows readily (Ref. 4, Theorem 9.3.1). 

Next, let us consider a compound source for which the source output 
in 7 time units is an n-vector X = (Xi, ---, Xn) € XL". The {X;} are 
still independent, but the X;, are not identically distributed. 

Let m1, m2, +++, nz be such that >oj.yn; =n, and let Q:(-), 
Q2(-), --:, Qz(-) be J probability distribution functions on X. The 
source is characterized by the fact that a known subset n,; of the n 
coordinates of X are distributed according to Q;(-), 7 = 1, --:, J. 
Let R;(A) be the rate-distortion function corresponding to Q;(-) rela- 
tive to the distortion function di. A code is defined exactly as above, 
and X = f(X). We now have 


Corollary 11: Let A; 20,7 = 1,2, --+-,J, and «, 6 > 0 be arbitrary. 
Then there exist A;,B; > 0,7 = 1, --+, J, such that, for alln = 1,2,--- 


and any set {n;}{ such that En; = n, there exists a code with parameter 
M satisfying 


Ms Tl expe {njLR;(A;) + €]} = expe {> njLR;(A;) + €]} (59a) 


and 


V 


, yd 
T(A + 6) = Pr {D,(X, X) = A+ 6} S ¥ A;Q-Fimi, (59) 
7=0 
where A = n)) n;A;. The (Aj, B;)’s are the (A, B) of Lemma 10 corre- 
sponding to Q;(-). 
The corollary follows immediately from Lemma 10 on noting that, 
for any random variables {U;} and any set of constants {c;}, 


Prijdy Uy 2 26} Sy Priv; =] Gi}. 


Let us also remark that the Q‘” (x) used to compute I'(A + 6) in the 
corollary is of the form 


QP (x) = TL Qrleey) TE Qa): TE Qr@n), (60) 


where the 7;,th coordinate of x has distribution Q;(-), 1 Sk S n,, 
OS; SJ = 

Let us now turn to our network coding problem. An alternative 
(though equivalent) way of defining a code for our network with 
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parameters (n, Mo, M1, M2) is 


(1) A mapping 
g: X*" XK y* > &, (61a) 


where © is an arbitrary set with cardinality <M». The mapping g is 
called a ‘‘core code.” 
(2) For each w € @, a mapping 


gP: 0" X Yyr— CM C Kr, (61b) 


where card C& < M,. 
(3) For each w € @, a mapping 


gQ: 07 X yr CEH CK Y, (61c) 
where card C& < Mz. 


The code defined in this way can be used on our network (Fig. 2) as 
follows. Let © = {w:}i”°. Then, if g(x, y) = w:, the index 7 is trans- 
mitted over the common channel. Let C& = {%,}#4,, 1 S75 Mo. 
Then, if g(x, y) = w:, and g(x, y) = Xi, we transmit the index 1 
over the private channel to receiver 1. The decoder at receiver 1, 
knowing the indices 7 and 1, emits X;., and the resulting distortion is 
Dy, (x, Xi). Receiver 2 works analogously. 

Let us fix our attention on receiver 1, and assume that g(x, y) 
= g(x). Then we define the quantity q(x, w,:)(x € %", w: € ©) as 
the probability that KX = x € 9X" and Y = y such that g(x, y) = wi. 
Thus, 

q(x, W:) = oe Q@m (x, y). : (62) 


y:9 (x,y) =wi 


Q™ (x, y) 4 []#=19 (2x, yz) is the probability distribution for X and Y. 
Then, as in (58) with X = g@(X), W = g(X, Y), 


T(Ai + 6) 2 Pr {Di(X, XK) > Ai.+ 6} = sap q(x, W:)®;(x), (63a) 


where 
fs 1, if Dix, gw: (x) J > Ai + 6, 
#.(x) 0, otherwise. (63b) 
Substituting (62) into (63), we obtain 
Mo 
Tii+d= Vl YL QQ, y)&i(x)}, (64a) 


i=l (zp WCC 
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where 
G: = {(@, y):9@, y) =wi}, 1Sis Mo. (64b) 


Now our goal is to show that there exists a code for n sufficiently large, 
with Mo, M;, M, appropriately chosen, and with !'(A; + 5) arbitrarily 
small. 

Let us assume that we are given a probability distribution p(z, y, w) 
€ @, where xEX, yEY, wEW. Let pw(w) = Lez, y, w), 
w €W be the marginal distribution of W. Assume with no loss of 
generality that pw(w) > 0. Let 


p(x, y, w) 
pw(w) 


be the ‘‘backward test channel.” For x € &", y€ y", w CW’, let 


po(z, y|w) = 


p™ (x, y; Ww) = e=1p (x, Yk, Wk) 


be the probability distribution for X, Y, W (n independent drawings 
of X, Y, W). Let pW (w) = [[z-:pw(wx), and p§(x, y|w) 
zs Tli=1o(z2, Yyx| We). For (x,y, w) € ©" X Yy" XW", let 


ps(x,y|w) Do(Ley Yx| We) 
OG, 8 OG, yn) 7? 


be the information “density.”’ Of course, 


Ei (X,Y; W) = 1{X, ¥;W} = nl {X,Y;W}. 


i) (x, y;w) = log 


Finally, let A; = 0 be given and let {A.}wew satisfy 


Rxiw(Ai) = DX pw(w)Rxw-w(Aw), (66a) 
weEW 
and 
Ar= D> pw(w)dAv. (66b) 
weWw 


See (35). A similar expression can be written for Ry;w(Ag). 

We now return to our network coding problem. With p € @ given, 
we set out to construct a core code g with certain desirable properties. 
For any core code g: X* X Y* > © = {w,.}i”? Cw, let Nia = the 
number of occurrences of symbol w in code vector w;, 1 SiS Mo, 
w € W. The existence of a desirable core code is assured by 


Lemma 12: Let p © © and «€ > 0 be arbitrary. Let I* = I(X, Y; W) 
correspond to p € @. For n sufficiently large, there exists a code g as in 


SOURCE CODING 1713 


(61a) such that 
(t) My S 22+e) 





(di) Nee py = qininae oy Joral we ©, 
(iit) Pr(Sp= DY QM (x,y) S « 
ages 
where 


1. 
Se = {(x, y):51Lx, y;9(a,y)] 2 I* — ¢}, 


and t(x, y; w) ts defined in (68). 

We defer discussion on the proof of Lemma 12 to the end of this 
section. 

Let us suppose that g is a code that satisfies conditions (2), (7), 
and (iz) of Lemma 12. Let {g&P}i" be a family of encoders as in 
(61b). Consider expression (64a). The term in braces is 


Xu Q(x, y) B(x) 


(x,y) GG 


= LQ (x, y) Bix) + e Q‘™ (x,y). (67) 
(x,y) EGiNs, (x,y) EGiNS, 
But if (x, y) € G; [ie., g(x, y) = w:)] and (x, y) € S,, then 
Q™ (x, y) S 2-9 ap” (x, y| wi), 


so that the first summation in the right member of (67) can be over- 
bounded: 


S2ron | pl (x, yl we)Bi(x) 
(xy): g(x,y) =wi 
(x,y) ES. 


SQ on YT ys (x, y) | ws) s(x) (68) 
x,y 
S2-(P en S yf” (x| ws). (x). 
Combining (68), (67), and (64), we have 
T(Ay + 5) = Pr {Di(x, x) > Ai+ 5} 


Mo Mo 
cea = pM(x|w)e(x) +O + Q™ (x, y) 


t=1 (x,y) EGiNS’ 


Mo 
S2-( on SY ps” (x| wi) s(x) + Pr (S39. (69) 
i=l x 
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Now consider 
ps” (x| Wi) = i po(rx| Wee), 


where wi = (wi, Wi2, +++, Win), 1 St S Mo. With Niw as defined just 
above Lemma 12, we see that (for a given 7) p{” (x|w:) is the same form 
as Q(x) in (60) with n; = Ni». It then follows from Corollary 11 
that, with w; held fixed, we can find a source code for X—i.e., a 
mapping g*’/—with parameter M = M, such that, for arbitrary 


e, 6 > 0, 


M, Ss eXpe2 { 2 NewlRxirew(Av) + ej}, (70a) 
ve 
and 
DX ps” (x[ wa) O(x) S DL Ay2-3eMiw, (70b) 


where ®;(x) is defined in (63b) with Ai = n™7YiuNiwAw, and {Ay} 
satisfying (66). The {(A., Bw)} correspond to p,(x|w). Further, since 
the {Ni} satisfy condition (77) of Lemma 12, (70) becomes [using 
(66) ] 


M, S expe {n ne (pwlw] + «)(Rxiw=ul Aw] + ¢)} 
S expe {n(Rx;wlAs] + H[X]+ 6} (71a) 


and 


S pf? (x|w.)bs(x) SX AyQ-Bwnrwlwt-o < CQ-"B0-9, (71) 
x vEew 


where C = (card W)-maxy,Ay and B = minyBupu(w). Substituting 
(71b) into (69) and using conditions (2) and (777) of Lemma 12, we have 


T(A, + 6) S 2-"4*-9) My-C2Q-7BG-9) + Pr (S89) 
S2-"(B-Be-2e) 4 ¢€ +0, asn— o and then e— 0. 


Since we can do an identical construction for Y, we have proved 


Lemma 18: Let p€@, and let the corresponding information be 
I(x, Y;W) = I*. Let Ay, As 2 0 and «, 6 > O be arbitrary. Then, for 
n sufficiently large, there exists a coding scheme as in (61) with param- 
eters (n, Mo, Mi, Me) such that 


(4) My S 2049), 
(ti) My S 2n(Rxiwanto, 
(iii) Mz S Qn(Ryiw(a)+0), 
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and 


(iv) Pr {Di(X, X) > Ar + 8} S «, 
(v) Pr {D.(¥Y, Y) > 42+ 6} Se. 


The following corollary follows from Lemma 13 in the usual way 
(exactly as does Theorem 9.3.1 in Gallager‘). 


Corollary 14: Let p€o@, and let the corresponding information 
I(X, VY; W) =I1*. Then, for arbitrary Ai, Az = 0, the rate-triple 
(a%, Rx w(Ai), Ry, w(Az) | 1s (Aj, A2)-achievable. Thus, RP) (Ay, Ae) 
C R(Ay, Az), for all p € @. 

The direct-half now follows on noting that, if S; CS, and Sz is 
closed, then the closure Sf C S2. Thus, ®* (Ai, As) = [U,R) (Aj, As) Je 
C ®R(Ai, Az), which is what we had to prove. 

It remains to prove Lemma 12. Since the proof is nearly identical 
to that of Lemma 9.3.1 in Gallager,* we will only outline the steps. 
Let ¢€ > 0 be arbitrary. For w € W”, let N..(W) = the number of 
occurrences of symbol w € W in the n-vector W. Then define 


Nw (9) 


T(e) = | #E wes all w Ew, — Dw(w )| < ¢ 


Then, paralleling aBED there exists a mapping g [as in (6la)] for 
which 
Mos < Qn(I*+e) 


and 
Pr | * DR, ¥5 9(K, Y]<sl—c o 9 X%NE To} 
P,(A) + exp {er} © £(n), 


IIA 


where e« > 0 is arbitrary and 
= | &% Y, W): either + i (X, ¥;W) > I* + cor 
* i (K, ¥; W) <i*—ecorWe 7(0| : 
and P,(-) is probability computed with respect to p(x, y, w) € @. By the 
weak law of large numbers, if e, < ¢, then é, > 0, asn— . 


Let the code whose existence we have just asserted be {w,}{"°. There 
must be at least one code vector, say, W1, which belongs to Te). Now 


Pr {g(%, Y) € T()} S &(n). 
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If x, y are such that g(x, y) € T(e), change g(x, y) to wi. The new code 
has g(x, y) © T(e) and 


Pr * jOLX, Y;9(X, Y)] $ I* — (| < 2¢(n) > 0. 
Thus, this new code satisfies conditions (7), (77), and (777) of Lemma 12. 
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APPENDIX 
A.1 Proof of the convexity of R(A:, A:) 
Let Ai, Az be given and held fixed. Write R(Aj, Az) as R. 


Theorem 15: ® ts convex. 


Proof: The theorem follows by a ‘‘time-sharing” argument. Let R®, 
R® € @, and 0 S 6 S 1. We must show that 


R ea R, (72a) 
R = 6R® + (1 — #)R®, (72b) 


Let (gz, 98, 98) and (hz, hS, A$) be codes with parameters 
(nr, MP, ME, MP, AY, AP) and (nm, MP, MP, MP, AY, AP), re- 
spectively, whist AQ, AP S Ai, AY’, AP S Ao. Say 6 = A/B, where 
A, B are integers,O S$ A <B < 0, We show how to construct a code 
. Ge, Mo, M1, M2, Ax, Ay), where 


* log M; = 0 ( = log MP ) + (1 —- 0)( = log MP), (73) 
1 


(c = 0,1,2), and Ax S Ai, Ay S Av. This will establish (72) for 
rational 6. Since the region @ is closed, (72) must hold for all 6, estab- 
lishing Theorem 15. 

We now define a code with block length n = cn + dv, where 
c = An, d = (B — A)ni. Let (x,y) © XL" X Y” be a sequence of n 
pairs. Partition this sequence into c blocks of n; pairs and d blocks of 
M2 pairs. Encode-decode the first c blocks using encoder-decoder 
(gz, 9S, gS), and encode-decode the remaining d blocks using encoder- 
decoder (hz, h§, h$?). Denote this combination encoder-decoder 
by (fz, fS, 2), Consider fz(x, y) = (So, Si, Se). The quantity 
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S:(¢ = 0, 1, 2) takes values in a set with 
(MP)>. (MP) SM 


members. This set can, of course, be put in 1-1 correspondence with 
Iy,. Thus, for 7 = 0, 1, 2, 


* log M; = “log MY + “Nog MP = = log MP + —— (l —*) log M?, 


which is (73). Further, the new code has Ax S Ay, and Ay S Az, so 
that the lemma follows. 


A.2 Proof of Theorem 1 

Again let A;, Ai 2 0 be given and fixed, and write ®(Ai, Az) = &, 
and ®(Ay, Ax) = &. 

We first establish part (77) of the theorem. Let R € S(a), «a € @’. 
If RE @, then there exists an R = (Ro, Ri, R2) © ®, such that 
R; SR; 1 =0, 1, 2, and at least one of these inequalities holds 
strictly. Thus, 


C(a, R) — C(e, R) = (Ro — Ro) + a1(Ri — Ri) 
+ a(R, — Rs) <0. (74) 


The inequality follows from a1, a2 > 0. This contradicts R € S(a). 
Thus R € @, which establishes part (72). 

It remains to establish part (2). We must first obtain some pre- 
liminary facts. 


Lemma 16: Let (Ro, Ri, Re) € @. Then 


(a) for a; 20 (@=0,1,2), (Ro + ao, Ri + a1, Re + ae) € @, 
(}) for OS681, [(1 — 6)Ro, Ri + ORo, Re + ORo] € &, 
(c) for OS 61,6. $1, 

[Ro + 0:Ri + O2Re, (1 — 61)Ri, (1 — 62)Re] € A. 


IA_ IIA 


Proof: 


(a) follows immediately from the definition of @. 


(b) follows on noting that data sent through the common channel 
can be transmitted instead through each private channel. 


(c) follows on noting that any data transmitted through either 
private channel can be transmitted instead over the common channel. 
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Next, for Ri, Re = 0 write r = (Ri, R2), and define the function 


F(r) = F(R, R2z) = min{Ro: (Ro, Ri, R2) = R}. (75) 


The minimum exists because ® is closed. Clearly, (Ro, Ri, Re) € ® 
only if Ryo = F(R, R2). 


Lemma 17: F(r) is convex. 


Proof: Letr™,r® be arbitrary. Then [F(r™), r© ], [F(r™), r@] © @. 
Since ® is convex, for0 $ 6 S 1, 


OCF (r), re] + (1 — a [F(r®), r@] 
= foF(r™) + (1 — *F(r), 6 + (1 — #)r@] Ea. 
Thus, by the definition of F(-), 
F{er{? + (1 — &—)r®] Ss OF (r™) + (1 — O)F(r™), 


establishing the lemma. 
Now it follows from the convexity of F(-) that, for arbitrary 
r* = (Ri, R2), RiRz = 0, there exists constants a; = a;(r*), 7 = 1, 2, 


such that, for all r, 


F(t) — F(r*) = 3 «:(R* — R,). (76) 


This is a statement of the well-known fact that any convex curve lies 
above a plane of support. Here the curve is the locus of points in 
(Ro, Ri, Re)-space given by Ro = F(r) = F(R, Re), and the plane is 
the locus of points Ry = F(r*) + 30?-10:(Ri — R,). Note that the curve 
and the plane coincide at r = r™*. 

Now let R* = (Ro, Ri, R2) € R. Then Rj = F(Ri, R2). Let R = 
(Ro, Ri, Re) be any triple in ®. Then with r = (Ri, Re), (76) yields 


F(r) + aR + ak, = Ro + aki + a2R3. 
Since, by definition of F(-), F(r) S Ro, we have: 
Ro + aki + ok; = min (Ro + aR: + a2Re) 


= min C(a,R) = T(e), 
RER 
where C'(a, R) and 7'(a) are defined by (12) and (13), respectively. 
Thus, we have shown that, if the triple R* € &, then R* € S(a), 


where a need not necessarily belong to @. 


SOURCE CODING 1719 


Now suppose that R* = (Ro, Ri, R3) € a, and R* € S(a) where, © | 
say, a1 < 0. From Lemma 16(a) (with a > 0), R = (Ro, Ri + a, R3) 
© @, and 


C(a, R) < C(a, R*), 


which implies R* € S(@), a contradiction. Thus, a; (and similarly 
a2) 2 0. Next, suppose R*E ®, and R* € S(a) where, say, a1 > 1. 
Then, from Lemma 16(c), R = (Rj + Ri, 0, R23) € R, and 


C(a, R) < C(a, R*), 


again a contradiction. Thus, a1, and similarly a. < 1. Finally, suppose 


that R* € ® and R* € S(a), where a1 + a2 < 1. By Lemma 16(b), 
R = (0, Ri + Ro, R2 + Ro) € @, and 


C(a, R) = C(a, R*), 
a contradiction. Thus, a, + a, 2 1. We conclude that 


RE VU S8(a), (77) 
aca 


where @ is the set of a = (a1, a2) that satisfy 0 S a1, ag S$ 1, and 
a1 + a, = 1. This is part (7). This completes the proof of Theorem 1. 


A.3 Proof of Lemma 5 


Let {A,}% be given, and, fork = 1, 2, ---, n, let qu(4|2, w), 2 € &, 
w © W, be a test channel for which 
»~ 2 d(x s£)qen (4| x, w) p(x, w) = Ax, (78a) 
w EW, 2,z 
and . 
I(X;X|W:.) S Rxyw,(Ad) + 6 (78b) 


where ¢€ > 0 is arbitrary. For w GW = DfiiW., rE xX, EC X, 
define the test channel 


Gi(4|z, w) = qux(£|z,w), forweWi1lsk sn. 


Then 


, d(z, £)q2(& | az, w)p* (x, w) 


3i— 


= © EE 7M ManlEle, w)pe@, w) s 


> Ag 
k=1 wews k=1 
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Thus, corresponding to the distribution p*(z, w)-qi(4|2, w), 
1(X, X|W) = Rxin( >2 as): (79) 
However, by a straightforward calculation, 
U(X, £|W) = 2 ¥ 1%, £1) s = 2, Rxim(Ak) + « (80) 


The inequality follows from (78b). Combining (79) and (80) and letting 
e— 0, we have Lemma 5. 
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When a suitable interfacial dopant, such as W, is introduced at the 
interface between the dielectrics of a DDC cell, the write-erase character- 
astics of the cell are greatly improved. The useful range of the dopant con- 
centration is determined to lie between about 10“ to 10% atoms/cm?. The 
interfacial dopant allows the fabrication of a DDC cell with relatively 
thick SiO. layers (>50 A). The result is a substantially permanent memory 
cell that can still be subjected to electrical write-erase at reasonable gate- 
voltage conditions. 


I. INTRODUCTION 


There has been an increasing interest in recent months in the dual 
dielectric metal insulator semiconductor (MIS) cell as a nonvolatile 
semiconductor memory element. A true nonvolatile semiconductor 
memory could replace the omnipresent magnetic memory because 
associated with it one also expects fast access capability as well as 
interface compatibility with other semiconductor logic circuits. Key 
features in such a semiconductor memory cell, then, are: true non- 
volatility, high-speed access capability, and ease of write-erase opera- 
tions. The development of the dual-dielectric charge-storage (DDC) 
cells have followed two parallel paths, both enjoying a limited success 
yielding commercial products. The first centers around the concept of 
the floating gate,’ an artificially created metallic charge-storage site 
located at the dual dielectric interface. The second uses the naturally 
occurring interfacial states existing at the dual-dielectric interface as 
the charge storage sites, as in metal-nitride-oxide-semiconductor 
(MNOS) memory transistors.? The advantages and disadvantages of 
these two approaches to the realization of DDC cells is reviewed 
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briefly. And then the concept of the interfacial dopants, the heart of 
this paper, emerges as a particularly beneficial compromise between 
these two concepts, resulting in an optimum DDC cell, with true non- 
volatility, yet with undemanding write-erase conditions. 


1.1 Floating-gate concept 


The early DDC cells were built around a floating gate introduced as 
the charge-storage site located between the two. dissimilar dielectric 
layers. The charge exchange between the floating gate and Si in this 
structure was achieved via electron tunnelling through a thin SiO, layer 
grown on the Si substrate. The advantage in the floating-gate concept 
lies in that the potential well can be tailored to the write-erase needs. 
In practice, a difficulty was encountered in achieving true nonvolatility 
in these cells, at least in large-volume-memory situations, because any 
single shorting path between the floating gate and either the Si sub- 
strate or the external gate was sufficient to cause rapid loss of the entire 
charge on the floating gate. It has been suggested*® that this short- 
coming may be overcome by replacing the continuous-metal floating 
gate with mutually isolated small metal islands separated by dis- 
tances shorter than the Debye length at the Si surface. Since then, 
attempts have been made?‘ to fabricate and characterize dual-dielectric 
MIS cells with metal islands at the dielectric-layer interface. In general, 
the nonvolatility has not been exceptionally good (the retention time 
being on the order of several hours) presumably because the existence 
of the metal islands have. degraded the quality of the dielectric layers, 
leading to higher leakage. The metal islands are also expected to give 
rise to a field-enhancement effect, causing more rapid stored-charge 
decay. . 

A more successful approach® to achieve true nonvolatility has been 
to insert a continuous floating gate between thick (approximately 
1000 A) SiO. layers. However, due to its large oxide thickness, this 
structure would require prohibitively large gate voltages to effect 
write-erase if tunnelling were to be used. Thus, injection of electrons 
from Si into the floating gate is accomplished by use of hot electrons 
created by biasing a nearby pn junction into avalanche.® This approach, 
although it gives rise to excellent nonvolatility, suffers from the lack of 
a ready electrical means to eject electrons from the floating gate. The 
erase operation is only possible either by thermal means or by ultra- 
violet irradiation, thus severely limiting the versatility of these de- 
vices. Although some arrangements which permit electrical erasing 
have been proposed and tested,’ they usually require increased com- 
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plexity in the cell structure as well as high gate voltages, and therefore 
do not appear practical at present. 


1.2 MNOS cells 


Another DDC cell structure uses the naturally occurring interface 
states at the dividing planes of the two dissimilar dielectric layers. 
The structure commonly referred to as the MNOS cell is the outgrowth 
of Si;N,-SiO.-Si MIS studies carried out initially by Szedon et al.’ 
Since then, there has been a considerable amount of work done in this 
area in the past few years, culminating in experimental memories as 
large as 2048 bits.® It has been well established that ease of write-erase 
operations in these devices requires use of very thin (a little less than 
30 A) SiO. layers on Si so that the charge exchange between the 
interfacial storage sites and Si may proceed via direct tunnelling across 
the SiO, layers. 

Figure la, a schematic energy-band diagram of an MNOS structure 
under bias, illustrates the direct tunnelling mechanism. Electrons from 
the Si conduction band tunnel through the energy barrier associated 
with the thin oxide layer to final states at the oxide-nitride interface. 
Some of these electrons may further penetrate into the nitride layer by 
hopping along the bulk traps. However, the majority of them are 
expected to reside at or near the dual-dielectric interface. If the oxide 
thickness is larger than about 30 A, an appreciable current flow is 
possible only when the electrons originating from the Si conduction 
band tunnel to the conduction band of the oxide and then proceed to 
the dual-dielectric interface (as in the Fowler-Nordheim tunnelling in 
Fig. 1b). In this case, however, most of the electrons reach the nitride | 
as hot electrons and only a small fraction of them are captured by the 
interface states. Since the Fowler-Nordheim tunnelling probability is 
significantly smaller than the direct tunnelling case at a given field, 
a higher field must be applied to induce a comparable current density. 
This would raise the field in the nitride and further impede electron 
trapping in the nitride. It is clear from the above that the conventional 
MNOS cells are expected to work well only with thin oxide layers. A 
similar argument is expected to hold true in the case of hole tunnelling 
in MNOS structures, although the discussion so far has been given in 
terms of electron tunnelling for simplicity. 

A major drawback in the MOS cells is, however, a direct consequence 
of the thin SiO, layer. The nonvolatility is limited by charge leakage 
through the thin SiO» layer by direct back-tunnelling. Although some 
claims have been made that nonvolatility of 10 to 100 years is possible 
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(a) Dirset tunnelling of electrons. (b) Fowler-Nordheim tunnelling of electrons. 


with MNOS structures, it appears that this feat is only achievable by 
use of thicker SiO, layers with the ensuing penalty in write-erase gate- 
voltage pulse both in height and width. The MNOS structures operable 
with a short write-erase time, say in l-us range, appear to have a non- 
volatility usually considerably less than 1 year. 

Outer dielectric layers other than SisN, have also been used in dual- 
dielectric charge-storage cells. A notable example is the use of Al2Os. 
In 1970, Nakanuma et al.,!° for example, showed that Al,O3 is a suitable 
outer layer. The nonvolatility of these cells is again critically dependent 
on SiO; layer thickness. Cells with SiO, layers about 100 A in thickness 
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were fabricated and showed good charge retention. However, their 
write-erase gate voltages had to be 50 volts or more and of 1 ms or so 
in duration. Also, the ejection of electrons from the interfacial states 
during the erase operation appears to be rather sluggish, possibly 
because the density of states is not high enough and/or the barrier 
height appears to be too high. Some attempts to increase the interfacial 
state density by, for example, varying the Al,O; deposition conditions 
appear to result in a lack of reproducibility and/or loss of good non- 
volatility. 


1.3 Interfacial dopants 


It is apparent from the foregoing that while a true nonvolatility in a 
dual-dielectric cell can only be expected with a structure utilizing rela- 
tively thick (greater than 50 A) SiO, layers, this has led to write-erase 
operations requiring high voltage and/or long pulses. These problems 
have arisen from the fact that the naturally occurring interfacial 
states are not really ideal for the various roles they have to play. 
Attempts to increase their density, which allows shorter time for 
write-erase, usually result in lossy outer dielectric layers. Although 
no specific description is available in the literature, past attempts in 
this regard appear to be comprised of variations in outer-layer-deposi- 
tion conditions to achieve a certain degree of off-stoichiometric outer 
layers, at least in the vicinity of the interface. 

It is reasonable to expect that if a method for controlling interfacial 
states without degrading the dielectric properties of the dual-dielectric 
layers were available, one might overcome the aforementioned diffi- 
culties. We wish to show in this paper that, indeed, such a means has 
been found. Interfacial states induced by certain dopants deliberately 
located at the interface do have beneficial effects such that considerable 
improvement in write-erase operations have resulted in conjunction 
with the use of relatively thick (greater than 50 A) SiO, layers, which 
are essential for nonvolatility. 

Interfacial dopant-induced states of a sufficient density will increase 
the capture probability of the incoming electrons. This is schematically 
illustrated in Fig. 2. Furthermore, the energy levels associated with 
these dopant-induced states will depend on the choice of the dopant. 
This would allow a suitable condition for the erase operation. The 
energy levels should not be excessively deep, since this would require a 
prohibitively high field for ejection of stored electrons. The energy 
levels should not be too shallow, since the stored charge might easily 
decay through thermal activation. The naturally occurring interfacial 


INTERFACIAL DOPANT FOR DDC CELL 1727 





$i09(100A) 


Fig. 2—Energy-band diagram of dopant-induced interfacial states. These states 
efficiently capture electrons arriving at the dual-dielectric interface following Fowler- 
Nordheim tunnelling through a relatively thick energy barrier. 


states clearly do not possess a comparable versatility as charge-storage 
sites. 

We find that there exists a range of dopant concentration which 
gives rise to optimum benefits. The upper limit of this range is dic- 
tated by the considerations that island formation by the dopant, either 
during its deposition or during the subsequent outer-layer deposition, 
is not really desirable since the dielectric property of the dual-dielectric 
layers is usually degraded. Furthermore, the initial deposition of the 
outer layers is strongly influenced by the presence of nonuniformity and 
may result in lossy layers. The upper limit also depends on the choice 
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of an outer layer and a dopant. Thus, Al.O3 with a nonoxidizing dopant 
such as Ir appears to limit the dopant concentration below 1 X 10%/ 
cm?. On the other hand, Si;N, with W may tolerate up to 5 X 10%/cm?. 
The lower limit is naturally dictated by the ease of write-erase opera- 
tions and appears to lie at around 1 X 10/cm?. 


1.4 Selection of dopants 


It is desirable that the dopants selected be physically confined at 
the dual-dielectric-layer interface. This means that both during dopant 
deposition and during subsequent outer-layer deposition, any migra- 
tion of the dopant is undesirable. The preferred method for depositing 
the dopant on the surface of the SiOz layer is thermal evaporation. 
Since good-quality outer layers are, at present, only obtainable through 
chemical vapor deposition (CVD) techniques at somewhat elevated 
temperatures of about 900°C, it is important that the deposited dopant 
does not diffuse into the SiO. layer and the outer layer at these tem- 
peratures. It is equally important that the deposited dopant does not 
evaporate away during the first phase of the outer layer growth. The 
vapor pressures of W, Pt, Ir, Ta, and Nb at 900°C are all lower than 
10-4 torr." On the other hand, Al has a vapor pressure of 1.5 X 10-5 
torr; thus, considerable amounts of Al could be lost. At any rate, 
control of the final dopant concentration may be more difficult in the 
case of Al. 

The vapor pressures of the dopant oxide may also be an important 
quantity to consider especially when the outer layer is an oxide layer. 
For example, tungsten oxide has a vaper pressure of approximately 
2 X 10-* torr." The diffusion constant of Pt into SiO» at 900°C is 
reported!’ to be 7.8 X 10—? cm?/sec. Compare this with the diffusion 
constant of Al into SiO, at 990°C of 8.2 X 10-™ cm?/sec" and that of 
P into SiOz at 900°C of 9 X 10-© cm?/sec.4 

It is evident from the above that indeed the suitable dopants avail- 
able are quite limited in number from the thermodynamic properties 
alone. The dopant-induced interfacial states should also possess suit- 
able properties. This might narrow the choice down further. We discuss | 
this aspect in more detail when we examine the electrical behavior of 
these cells, since it is in this aspect that the dopant-induced states 
come into strong play. 


i. CELL FABRICATION 


Our experimental vehicles have been capacitors and IGFETs built 
on an Si substrate about 5-ohm-cm n- or p-type, oriented (111) or 
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(100). The usual mechanical-chemical polish was given before the 
first thin SiO, layer growth was carried out in dilute O2 in He at about 
1100°C. Prior to this important thin-SiO.-growth step, the wafers were 
oxidized in 100 percent O: to a thickness of about 1000 A. This initial 
oxide was kept until the wafer was ready to receive the treatment for 
the thin oxide layer when this thick oxide layer was stripped in a 
buffered HF and thoroughly rinsed in deionized water. The thin-oxide- 
growth time was kept at 10 minutes. The partial pressure of O2 was 
varied to give rise to the desired first SiO. layer thickness of 50 A to 
150 A. The dielectric breakdown strengths of the thin-oxide layers 
were usually 7 to 8 X 10° V/cm defined as the dec field at which the 
current density reaches 5 X 10-8 A/cm’. 

The dopant evaporation was performed with an E-gun. A quartz 
oscillator monitor was located about 5 cm away from the source and 
the samples were located at various distances away from the source 
depending on the amount of dopant desired. A shutter was employed 
to initiate the dopant flux as well as to shut it off. An exposure time of 
2 minutes was generally used. The monitor had a capability of mea- 
suring 5 X 101*/cm? of deposited dopant to within 50-percent accuracy. 
A dopant concentration of 5 X 10/cm? could easily be obtained on 
the sample surface when the sample was located at about 50 cm away 
from the source. 

The dopant-covered samples were then ready for the outer-dielectric- 
layer deposition, which was either an Al,O; deposition using the well- 
known techniques of AICl;-CO.-H. CVD or an Si;N, deposition via 
SiH.-NH; CVD. A ratio of SiH, to NH; of about 0.01 was used to 
obtain Si;N, layers of least conductance.!® We feel that freedom in 
choosing CVD conditions for optimum results in outer layers, such as 
low conductance and/or high-dielectric constant, need not be sur- 
rendered in our case since the interfacial states configuration is more or 
less independent of the chemical nature of the outer layer when the 
interfacial dopants are employed. This is important because a true 
nonvolatility requires not only relatively thick SiOz layers but also 
good insulating outer layers. Outer-layer thickness in the 300-A to 
700-A range were investigated in this study. 

Some of the doped and undoped dual-dielectric layers were sub- 
jected to Rutherford back-scattering experiments” to ascertain dopant 
location and its concentration. The dopant concentration was within 
a factor of two of the estimated values from the indication on the 
monitor. The dopant location is judged to be at the dielectric interface, 
to the degree that this type of probe can be used to certify this. 
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The dual-dielectric layers were finally contacted with Al by evapora- 
tion from a devitrified carbon crucible through masks defining 15-mil- 
diameter circular areas for test capacitors. In addition, we have fabri- 
cated IGFETs with the dual-dielectric gate insulator for examination, 
more or less following a standard procedure. Again Al was used as the 
gate electrode. 


Ill. MEMORY CHARACTERISTICS 


IGFET threshold voltages are the most convenient parameter to 
assess memory behavior of the dual-dielectric charge-storage cells. 
Figure 3 shows write characteristics of a cell with interfacial dopant W 


Alg03 520A W_‘1.5x 10'5/em2 


Si07 70A  N-CHANNEL 


THRESHOLD VOLTAGE IN VOLTS 





0.1 1.0 10 102 103 104 
WRITE—PULSE WIDTH IN us 


Fig. 3—Shifts in threshold voltage in n-channel IGFETs with dual-dielectric gate 
insulation consisting of 70-A-thick SiO. and 520-A-thick Al.O; after application of 
positive-gate-voltage pulses. Solid lines apply to structures containing 1.5 & 10!°/cm? 
of W at dual-dielectric interface; dashed lines apply to structures with no interfacial 
dopant. The initial threshold voltages were at 1 volt. 
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of 1.5 X 10!°/cm? in concentration. The n-channel IGFET threshold 
voltages attained after positive-gate-voltage pulses (with respect to 
the Si substrate) were applied are plotted as a function of pulse width. 
The threshold voltage was initially at about 1 V. After each pulsing, 
the threshold voltage shifted to a larger positive value as one might 
expect since the pulsing resulted in electron injection from the Si sub- 
strate into the storage states. The SiO.-layer thickness was about 70 A 
and the Al,O;-outer-layer thickness was about 520A. It is evident 
that this structure allows shifts in threshold voltages of as much as 
7 V with only a 35-V, 100-ns pulse. A similar shift can be obtained with 
a 30-V pulse when the pulse width is increased to 100 us. For compari- 
son, plots are shown obtained with cells located on the same wafer 
identical to those discussed above, except that the W dopant was not 
present. These curves are shown in Fig. 3 in dashed lines. With 35 
volts, it is necessary to use a 10-ms pulse width to produce a similar 
shift in threshold voltages. 


Aln03 520A -W_1.5x 10!®/cm2 


SiO 70A  N-CHANNEL 


THRESHOLD VOLTAGE IN VOLTS 





0.1 1 10 102 103 104 
ERASE—PULSE WIDTH IN us 


Fig. 4—Shifts in threshold voltage, initially at 8 V, after application of negative- 
gate-voltage pulses to the IGFETs described in Fig. 1. 
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THRESHOLD VOLTAGE IN VOLTS 
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Fig. 5—Shifts in threshold voltages, initially at —8 V, after application of positive- 
gate-voltage pulses to the IGFETs described in Fig. 1. 


Even more spectacular differences between memory cells with and 
without the interfacial dopant are evident when one examines the erase 
characteristics shown in Fig. 4. The initial threshold voltage was at 8 
V prior to application of erase-voltage pulses. The usual sluggish erase 
behavior of an Al,03-SiO2-Si structure has clearly disappeared with 
similar structures with W as the interfacial dopant. An important 
additional feature of cells with interfacial dopant W is the fact that 
Al,0;-SiO--Si structures allow positive charging as well, which is usually 
not the case without the interfacial dopant. Figure 5 shows the erase 
characteristics after positive charging to the extent of —8 V in thresh- 
old voltage. 

Dual-dielectric charge-storage cells with thicker insulator layers show 
similar improvements when the interfacial dopants are used. Figure 6 
shows the erase characteristics of cells with a 100-A-thick SiO. layer 
and 570-A-thick Al.O; layer. Again curves pertaining to cells with and 
without interfacial dopant W are shown for comparison. As expected, 
the pulse voltages required to induce comparable threshold-voltage 
shifts are increased from the values needed with thinner structures 
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Alg03 570A = W- 1.5 x 10'5/cm2 
Si02 100A  P=CHANNEL 


THRESHOLD VOLTAGE IN VOLTS 





ERASE-—PULSE WIDTH IN us 


Fig. 6—Shifts in threshold voltage, initially at 8 V, after application of negative- 
gate-voltage pulses to p-channel IGFETs with dual-dielectric gate insulation con- 
sisting of 100-A-thick SiO. and 570-A-thick Al,0;. Solid lines apply to structures 
containing 1.5 X 10'5/cm? of W at dual-dielectric interface; dashed lines apply to 
structures with no interfacial dopant. 


(Fig. 4). However, dramatic improvements in erase characteristics in 
structures with the interfacial dopant are clearly evident. Here again, 
the initial threshold voltage of 8 V was used prior to the erase opera- 
tion. The write characteristic also shows similar improvements. 

One does not expect a strong dependence of the erase characteristics 
on the initial amount of charge in storage (although the self-induced 
field is linearly super-imposed on the externally applied field) because 
the induced field is very small compared to the externally applied field. 
This is shown in Fig. 7 where the threshold voltages after application 
of the erase pulse are plotted as a function of pulse height with the 
initial threshold voltages as a parameter. A pulse width of 200 us was 
used for this experiment. As can be seen, the erase curves quickly con- 
verge with each other. For a —30-V pulse, the resulting threshold 
voltages are identical for all practical purposes. This is a useful feature 
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THRESHOLD VOLTAGE IN VOLTS 





-50 -40 -30 -20 -10 0 
ERASE-PULSE HEIGHT IN VOLTS 
Fig. 7—Shifts in threshold voltages, initially at three different values, in IGFETs 
described in Fig. 4 after application of negative-gate-voltage pulses of 200-us duration 


and various pulse heights. Also shown are shifts in threshold voltages, initially at 8 V, 
after application of pulses of 10 ys, 1.0 us, and 0.1 ys in duration. 


in that the various initial threshold voltages may represent cells that 
have gone through various amounts of information storage time after 
the simultaneous write operation. Figure 7 also shows erase curves 
using shorter pulse widths for comparison. 

The interfacial-dopant concentration is expected to have a lower 
limit for which the beneficial aspect of the interfacial dopant is not so 
evident. This lower limit is established to be about 1 X 10'*/cm?, as 
can be seen from Fig. 8. Here the amount of interfacial dopant W is 
1.5 X 10%/cm?. When compared to the mate cell with no interfacial 
dopant, the erase curves show some effect, but it is not as pronounced 
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Alg03 400A W 1.5x 10'4/cm2 


Si0) 125A  P-CHANNEL 


THRESHOLD VOLTAGE IN VOLTS 





0.1 1 10 102 103 104 10° 
ERASE PULSE WIDTH IN us 


Fig. 8—Shifts in threshold voltage, initially at 8 V, after application of negative- 
gate-voltage pulses in p-channel IGFETs with dual-dielectric gate insulation of 
125-A-thick 8i02, 400- A-thick Al.0;. Solid lines apply to structure with 1.5 x 10%/cm? 
- W at dual-dielectric interface; dashed lines apply to structures with no interfacial 

opant. 


as in earlier comparisons with interfacial-dopant concentration at a 
larger value. 

It is well known that MNOS cells do not function well when the 
SiO. layer thickness is larger than 50 A. This is not the case when the 
interfacial dopant is introduced. Figure 9 shows write-erase charac- 
teristics of MNOS cells with a 500-A-thick Si;N, layer and a 100-A- 
thick $102 layer. Not only does this cell function well with 1.5  10%/ 
cm? of W interfacial dopant but it also shows negative charging. It is 
virtually impossible to charge MNOS cells negative with any SiOz 
layer thickness when no interfacial dopants are present. The mate cell 
with no interfacial dopant (see Fig. 9) shows some charging with posi- 
tive charges. However, it is not possible to erase this cell before a 
catastrophic breakdown sets in. In Fig. 9, an initial value of 8 V in 
threshold voltage is again used for erase curves and 1 V for the write 
curves. For cells with no interfacial dopant, the initial threshold 
voltage was near zero. 
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Siz3Nq 480A W 1.5x10'5 
SiO) 100A P-CHANNEL 


THRESHOLD VOLTAGE IN VOLTS 





10° 


PULSE WIDTH IN us 


Fig. 9—Shifts in threshold voltages—initially at 1 V after application of positive- 
gate-voltage pulses, and initially at 8 V after application of negative-gate-voltage 
pulses—in IGFETs with dual-dielectric gate insulation consisting of 100-A-thick 
SiO. and 480-A-thick Si;N, and containing 1.5 X 10!5/cm? of W at dual-dielectric 
interface. Dashed lines show shifts in threshold voltages, initially at 0 V, after applica- 
tion of negative-gate-voltage pulses in same IGFETs but with no interfacial dopant. 


We have also examined dual-dielectric charge-storage cells with inter- 
facial dopants other than W. Ta induces storage states with a behavior 
indicative of two distinct energy levels for electron trapping. The 
shallower level can be filled with electrons and emptied as well. How- 
ever, the second deeper level can only be filled, or at least it was im- 
possible to completely empty this level. This behavior was observed 
for both SisN, and Al.SO;3 outer layers. We do not understand the 
behavior of Ta-induced states well enough at present to warrant 
further discussion at this time. 

We have also examined Ir-induced interfacial states. Their behavior 
is fairly close to that of W-induced states. Exact comparison, however, 
requires a further study. It could be conjectured that the dopant- 
induced states may show in their behavior marked correlation with the 
relative oxygen affinity of the dopant employed. Ir and W more or less 
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lack the ability to steal oxygen from SiOz or Al.Os in our thermody- 
namic environment, while Ta is expected to be oxidized. 


IV. SUMMARY AND CONCLUSION 


It is shown in this paper that when suitable interfacial dopants such 
as W are introduced in a well-defined concentration range, the write- 
erase characteristics of dual-dielectric charge-storage cells are enor- 
mously improved. The upper limit of the dopant is dictated by its 
influence on the dielectric properties of the outer layer and is deter- 
mined to lie at about 10!°/cm? for Al,O3. Cells with 5 X 10!°/cm? inter- 
facial dopants showed marked increase in charge leakage. The lower 
limit is determined to lie at about 10%/cm? by comparison with cells 
with no interfacial dopants. 

The improvement in write-erase characteristics of these cells is of 
such a magnitude as to allow using relatively thick SiO, layers (greater 
than 50 A) in these cells, which is mandatory for long memory-reten- 
tion time (longer than 20 years at 100°C). (Detailed studies of retention 
time will be published separately.!8) This study indicates that cells 
with thinner outer layers (approximately 300 A of outer layer and 
approximately 60 A of SiO.) that would operate at gate-pulse voltages 
in the 25-V, l-us range, should be feasible. 
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Bias-Temperature-Stress Studies of Charge 
Retention in Dual-Dielectric, 
Charge-Storage Cells 


By K. K. THORNBER, D. KAHNG, and C. T. NEPPELL 
(Manuscript received February 19, 1974) 


We present a simple, relatively efficient method to predict the nonvolatility 
of dual-dielectric, charge-storage cells. Using this method, charge-retention 
times of several hundred years can be predicted unambiguously from ex- 
periments of several days’ duration made at elevated temperatures and with 
externally applied, accelerating fields. The method 1s first presented in the 
context of a simple, physically reasonable model of the bias and temperature 
dependence of the characteristic relaxation time of the device. It is then 
used to analyze a particular device structure. At 80°C, we predict that this 
device can maintain a flatband-voltage shift in excess of 4 volts for approzxi- 
mately 500 years. Our analysis suggests that this method can be applied 
to a variety of dual-dielectric, charge-storage cells to predict their non- 
volatilities. 


I. INTRODUCTION 


The retention time! of charge in dual-dielectric, charge-storage 
cells!” (DDC’s) is of central importance in evaluating and comparing 
these devices. An ideal device would hold its charge at a constant level 
indefinitely. While, of course, such an ideal device is physically im- 
possible, nonideal, charge-storage memory cells have been fabricated 
that are expected to have minimum charge-retention times on the 
order of tens of years. And, as such devices are improved, even longer 
retention times can be expected. The question that naturally arises is 
how to evaluate such devices in a few days to predict their charge- 
retention times. 

The primary purpose of this paper is to describe a method of bias- 
temperature stressing in which the decay of the stored charge is greatly 
enhanced and, as a result, from which one can predict the charge- 
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retention time of the device under normal operating conditions (zero 
bias and room, or somewhat higher, temperature). We stress at the 
outset, moreover, that our primary concern is charge retentivity and 
not device reliability. Bias-temperature-stress methods are often used 
to good advantage in reliability studies to artificially reduce the 
lifetime of the device. Here, we are using similar methods to arti- 
ficially reduce the retention time of charge stored on the device. 
For reliability studies, one makes use of models in which the device 
lifetime is governed by temperature and applied bias. Owing to the 
complexity of the aging, however, these models are often quite tenta- 
tive. Here we use a well-defined model in which the rate of decay of the 
stored charge is governed by temperature and bias. As in reliability 
studies, experimental results are used to determine the parameters of 
the model. While the bias dependence of the rate of decay of stored 
charge has been noted previously,’ the work we report on here is the 
first in which bias and temperature stressing have been used simultane- 
ously to predict charge retention under normal operating conditions. 
We feel obligated, therefore, to elaborate our approach and findings in 
some detail. 

At the heart of our method of prediction are the observations (2) 
that there exists an “envelope” function that sets an upper bound on 
the total stored charge that can be present in the device at any given 
time, independent of initial conditions, (iz) that this envelope function 
is determined primarily by the characteristic relaxation of the device, 
and (271) that this relaxation is a strong function of temperature and 
electric field. By focusing attention on the envelope function, which 
properly indicates the long-time decay of the stored charge, we avoid 
incorrect predictions of decay time based on extrapolations of the 
initial portion of the charge-decay curve. (If the charge decay is 
plotted versus log time, such predictions are overoptimistic; if it is 
plotted versus time, such predictions are overpessimistic.) From mea- 
sured values of the characteristic relaxation obtained at elevated tem- 
peratures and under applied bias, relaxation times of interest at room 
temperature and under zero bias can be predicted. Owing to the non- 
‘ linearities inherent in the charge decay, these zero-bias, room-tem- 
perature relaxation times are functions of the initial stored charge; 
the larger the initial stored charge, the smaller the relaxation time. 
For example, for one version of our device we find that, for an initial 
charge corresponding to a flatband voltage of 10 V, a relaxation time 
of 3-104 years is predicted; for 7 V, we predict 6-104 years. Using our 
results, predictions of relaxation times can also be made for devices 


1742 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 


operated under bias and at other than room temperature. For ex- 
ample, at 80°C under zero bias we predict for 10 V a relaxation time 
of 100 years and for 7 V 300 years. Linear extrapolations of the initial 
portion (¢ < rrelax) of the charge-decay curve would result in misleading 
estimates of charge-retention times on the order of 10° years. 

It is important at the outset to stress that we are concerned in this 
paper with devices” with ‘thick’ oxide layers; that is, with oxide 
layers between the semiconductor and the charge-storage sites suffi- 
ciently thick that the rate of decay of charge via back-tunneling 
through the oxide layer is small compared to the rate of decay of 
charge through the insulator layer between the storage sites and the 
gate. While one can accelerate the back-tunneling charge decay of thin- 
oxide devices’ by applying an electric field, we believe that, unless one 
has an exceptionally good means of characterizing the specific tunneling 
processes of interest, such information is of little direct value in esti- 
mating the decay in the absence of such field, i.e., under normal oper- 
ating conditions. The applied voltage, of course, can accelerate or 
decelerate decay through the oxide layer or the insulator layer, depend- 
ing upon its polarity. If electrons are stored, a positive gate voltage 
enhances the decay in thick-oxide devices because the decay is through 
the insulator, whereas the same voltage reduces the decay in thin-oxide 
devices’ where the primary decay is back through the oxide. One 
reason for studying thick-oxide devices is to determine the limits on 
the retention time of charge-storage devices imposed by charge decay 
through the insulator layer. Such limits are of considerable interest, 
especially for devices whose back-tunneling decay is sufficiently low 
that, based on this decay mechanism alone, one might exaggerate the 
device’s charge-storage capabilities. 

In this paper we first discuss in general terms the physical processes 
that lead to the decay of the stored charge. A relatively thick oxide 
layer is used so that the primary discharging current is through the 
insulator layer. This current through the insulator is found to be char- 
acterizable as a thermally activated flow of charge via defects with a 
very low, but strongly field-dependent, mobility. We then discuss how 
this strong temperature and electric-field dependence of the decay 
current can be used to controllably accelerate the discharging of the 
DDC. A simple, physically reasonable, empirical model is introduced 
to explain the experimentally measured decay of the flatband-voltage 
shift in time with temperature and externally applied bias as param- 
eters. The mathematical expression for the decay current is sufficiently 
simple that all quantities of interest, particularly device relaxation 
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times, can be calculated analytically. From the high-temperature, 
high-accelerating-field results, it is possible to predict low-temperature, 
zero-bias behavior. We can thus unambiguously determine nonvola- 
tility on the order of tens or hundreds of years on the basis of experi- 
ments performed in several days or weeks. 

Before proceeding, it must be emphasized that the method of bias- 
temperature stressing that we develop here is not limited to devices 
which behave according to our simple empirical model. Indeed, as we 
point out in Appendix A, the central ideas at the heart of the method 
are valid, within certain limits, to a variety of models: the method is, 
within certain bounds, insensitive to the details of the actual decay 
process. We discuss the physical processes that we believe are opera- 
tive in our devices first, however, because this will permit the reader 
to familiarize himself with the actual devices to which our method is 
then applied. Although our results indicate the general features of a 
particular, detailed decay model, we emphasize that we are not pri- 
marily concerned with establishing the existence of such a model. 


ll. PHYSICAL PROCESSES 


The structure and fabrication of the memory devices studied here 
have been described in detail elsewhere."* For our purposes here, it 
suffices to note that a typical device consists of the following layers 
(see Fig. 1): a metallic contact (taken fixed at zero voltage), n- (or p-) 
type silicon, 70 to 200A of SiO» (referred to as the oxide layer, or 
simply the oxide), a set of suitable dopant-induced storage states — 
(referred to as the storage states), 400 to 500A of Al.O; or SisN, 
(referred to as the insulating layer, or simply the insulator), and a 
metallic contact (referred to as the gate). Inclusion of the specific 
states, which are the interfacial dopant-induced states, makes it 
possible to store either electrons or holes, whichever is desired, as well 
as to provide very well-defined storage sites for the elementary charges. 
The oxide and the insulator are sufficiently thick that tunneling from 
one side of either to the other side can be neglected. 

The device can be charged either negative or positive as follows. 
Negative charge can be stored by driving the gate to a high positive 
voltage (250 V) for a short period of time (® 100 us). Some electrons 
in the Si tunnel into the conduction band of the oxide, traverse the 
oxide, and then are trapped in the storage states or pass completely 
through the device to the gate electrode. Positive charge is stored by 
driving the gate negative to force electrons out of the storage states, 
through the oxide, and into the semiconductor. 
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Fig. 1—Schematic of the structure of the MIOS memory device under five differ- 
ent, applied-bias conditions. The location of the interfacial, dopant-induced, charge- 
storage states is indicated. They are assumed to be charged negative. 


Our experimental results indicate that these multilayered memory 
devices discharge as follows. (Refer to Fig. 1 in which a device is shown 
under five different biases. Note, in particular, Fig. la, in which the 
semiconductor is inverted near the SiO.) The (negative) stored charge 
produces large electric fields in both the oxide and the insulator. In the 
first few hundred milliseconds, some fraction of the charge is lost 
through thermionic emission or tunneling into the conduction bands of 
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the oxide and/or the insulator. For most of the stored charge, however, 
the trapping levels are sufficiently deep that these processes are very 
improbable. Charge can tunnel into traps in the insulator, and through 
such imperfections a discharging current can pass into the conduction 
band of the insulator and flow to the gate. Some charge can also pass 
completely through the insulator in the forbidden band by means of 
these traps. In addition, in a similar manner, a discharging current can 
be present in the oxide and flow to the Si. 

The size of the discharging current is expected to be strongly field 
dependent.!~® In addition, we note that increasing the (positive) gate 
voltage enhances the electric field in the insulator and decreases this 
field in the oxide. The manyfold increase in the decay current which 
occurs when a positive voltage is applied to the gate thus implies that 
the primary discharging current is passing through the insulator. In- 
versely, when a negative voltage is applied to the gate, a decrease in 
the decay current is observed. In this case, the change in the applied 
fields is such as to increase the current through the oxide and decrease 
the current through the insulator. A detailed study of the reverse gate- 
bias decay was not undertaken because of the long time intervals 
required even at high temperatures. For the present, it is sufficient to 
note that the total decay current increases with positive voltage and 
decreases with negative voltage applied to the gate. 

The physical details of the transport of the charge within the insu- 
lator can be narrowed down as follows. The strong temperature en- 
hancement of the decay of the charge indicates that thermal activation 
rather than tunneling plays the dominant role in enabling charge to be 
transferred toward the gate. The question of whether the charge passes 
through the insulator to the gate entirely within the forbidden band, 
or partially in the conduction band, is more difficult. The relatively 
low thermal activation energies observed (0.6 eV) suggest that the 
rate limiting portion of the transport is not associated with the con- 
duction band. If it were, much higher decay currents would be ex- 
pected. We infer, therefore, that the current is controlled by the traps 
in the bulk of the insulator. 

For charge transport from one trap to the next, one might expect 
the current to be proportional to exp [(#, + qg&a/2)/kT ], according 
to Poole’s law.“ (Here EZ, is an ionization energy, & is the electric field 
in the insulator, and a is the intertrap spacing.) However, Poole’s 
law, or its extension and modification by Frenkel!> 6 and others!’—® 
include only the thermal excitation of carriers and do not include 
velocity-versus-field effects connected with the transport from one site 
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to the next.2° Current density is proportional to gnv, where n is the 
number of charges per unit volume and » is the carrier velocity. One 
expects 7 to be thermally activated; however, in an insulator, v can 
be a rapidly varying function of applied fields,?!-?? whether electron 
transport is in the conduction band or in a defect ‘‘band,” as seems to 
be the case here. Thus, whereas for pure Frenkel-Poole behavior the 
excitation of electrons is the rate-limiting, field- and temperature- 
modulated process, for our devices both n and v are so modulated. 

Our experimental results indicate that the discharging current is 
proportional to the empirical expression 


&(7’) exp (—E./kT) exp [+ 6/8.(T)], (1) 


where &,(7') is a very slowly varying, decreasing function of the tem- 
perature 7’. To the accuracy of the experiments, the activation energy 
E, is independent of the applied field. The decrease of &,(7) with 
temperature could be due to an increased overlap between polarized 
electronic states because of increased lattice vibrations. The electric 
field dependence can in general be expected to be more complicated 
than that given in eq. (1). Fortunately, eq. (1) suffices for our purposes, 
partially because the decay curves are relatively independent of the 
detailed dependence of the discharging current on electric field. See 
Appendix A for a discussion of more complicated field dependences. 

One final point should be made regarding the decay current. As part 
of the assumption that the rate-limiting process controlling the charge 
decay is the trap-controlled current within the bulk of the insulator, 
we have assumed that the number of carriers in transit is independent 
of time. In other words, as a trap in close proximity to the storage 
states loses its charge to a trap somewhat closer to the gate, the emptied 
trap is quickly refilled with a charge from a storage state. Thus, the 
average time for a charge to pass from a storage state to an empty trap 
is much less than the average time for a charge to pass from one trap 
to the next. This results from the high density of storage states, and 
makes the effective number of carriers available for transit independent 
of the magnitude of the stored charge and hence independent of time. 
This is reasonable throughout most of the decay owing to the large 
number of stored electrons. Near the end of the decay, however, when 
the amount of stored charge is much less than its initial value, the rate- 
limiting process may become the transfer of charge from the storage 
states into the insulator. We ignore this effect, since, by the time it 
becomes important, the fundamental relaxation time of the device will 
be well determined. 
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From the foregoing discussion, it is evident that we can artificially 
enhance the decay greatly by increasing the temperature and the 
electric field in the insulator. In so doing, however, we must be very 
careful that we do not excessively enhance charge-transport mecha- 
nisms that under normal operating conditions would play no sig- 
nificant role. For example, by applying too large a potential at the 
gate, the field in the oxide will be enhanced to such an extent that the 
storage layer will be slowly charged by electrons tunneling from the Si 
into the conduction band of the oxide. Excessive temperatures, on the 
other hand, may enhance decay into the insulator’s conduction band, 
exaggerating this mode of charge transport. These effects are often 
easily detected experimentally, and they give rise to upper limits on 
the increases in temperature and electric field possible to enhance 
charge decay for measurement purposes. 

Although the decay of charge from the storage states can be en- 
hanced by increasing either the temperature or the insulator electric 
field, the enhancement from either is insufficient to bring the decay 
times into the range of days. However, this can be achieved by com- 
bining higher temperature with higher fields. Of course, we again must 
be careful to avoid introducing decay processes that under normal con- 
ditions would be insignificant. (An example of this would be Schottky 
emission.) Nonetheless, we find that we can obtain the entire charge 
decay curve within a few days (at most) without introducing extrane- 
ous decay mechanisms. From these curves, we can predict the normal 
decay curve and hence the charge-retention time of the memory ele- 
ment. In this way we avoid having to determine this retention time by 
extrapolating the initial portion of the time dependence of the charge 
decay. In the next section we see how the nonlinear dependence of the 
rate of decay on the quantity of stored charge leads to characteristic, 
charge-decay curves from which one can predict the nonvolatility of 
DDC’s. 

In those cases where certain alternative conduction mechanisms are 
activated, this method may or may not work. In some devices, the 
charge decay is via Fowler-Nordheim tunneling from the storage states 
into the conduction band of the oxide or the insulator, or it is via direct 
tunneling from these states into the semiconductor. Neither of these 
decay currents will be affected by changes in the temperature; how- 
ever, both such currents will be strongly modulated by applied bias. 
In the case of Fowler-Nordheim tunneling, our method is directly 
applicable. For direct tunneling, however, in which the primary decay 
is via $i-SiO, interfacial states, we must be very careful as we increase 


1748 = =THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 


the bias not to enhance the tunneling into semiconductor states in the 
valence or conduction band, which under normal conditions would 
play no important role in the discharging of the DDC. Perhaps one can 
modulate the decay by altering the thickness through which the elec- 
trons tunnel, either by physically squeezing the sample or by con- 
structing samples of varying thicknesses. 


lil. EMPIRICAL MODEL AND MEASUREMENT TECHNIQUE 


Let us assume that we have some (negative) charge stored in the 
interfacial, dopant-induced states. The quantity of charge stored in 
these states is proportional to the measurable flatband voltage V,;,. Also 
proportional to Vy is the field in the insulator under zero-bias con- 
ditions. In the presence of a finite bias voltage Vz, an additional field 
proportional to the bias voltage will be linearly superimposed on the 
zero-bias field (with a different constant of proportionality). Thus, 
using expression (1) and expressing fields in terms of voltages, we 
write for the time rate of change of Vys, the flatband voltage, 


dV 4 V.(T) Vin + bVe 
“dt (TP) exp | Vi(L) i @) 
where V, and 7 are experimentally determined functions of temperature 
only and 6 is a relative constant of proportionality (see Appendix B). 
Equation (2) presumes decay via the insulator; for decay through the 
oxide, one would have the factor (Vj, — Vs) in place of the factor 
(Vy. + bV.). This empirically satisfactory expression is reasonable 
physically, as explained in Section II. It also permits a simple, analytic 
treatment of all significant features of the decay. Should the flatband 
voltage decay according to a relation other than (2), one can still 
employ bias-temperature stressing. This is indicated in Appendix A. 

Before solving eq. (2), we should clarify two points. (7) We do not 
require eq. (2) to be valid for é < 1 minute. It may be valid there but, 
being interested in the long-time decay of the charge, all we need to do 
is to integrate eq. (2) from some time #;, on the order of minutes after 
charging, when the experiments are begun. (77) Since dV ;,/dé does not 
vanish as (V;, + bVs) — 0, eq. (2) is clearly not valid ast — «. As 
remarked in Section II, this difficulty results from assuming that there 
are always a large number of stored charges, whereas in fact near the 
end of the decay this number becomes small. Again, this difficulty need 
not concern us, since we do not have to consider the leakage after the 
bulk of the stored charge has decayed away. (See Appendix A for a 
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more general treatment.) With these limitations in mind, we proceed 
to solve eq. (2). 

The solution to eq. (2) follows at once after a separation of variables 
and an integration from (f1, Vi) to (t, Vs»). The resulting V,.(¢) for the 
decay of the flatband voltage as a function of time ¢ at temperature T 
and applied bias V; is given by 


Va) = V.(T) log. { (exp L—Vi/V.(1) J + CG — &)/7(T)] 
-exp LbV2/V.(T)])“}. (8) 


For a single value of 7’, this function is sketched in Fig. 2 as a function 
of logiy (t/t1) for several V, and for hypothetical, illustrative values of 
V,, 7, and b. The purpose of plotting as a function of logio (t/t1) is to 
call attention to the actual long-time behavior of V(t). For ti < t 
«K 7(T) exp [— (Vi + 0V>2)/V5(T) ], Veo(t) is relatively flat. However, 
for t > 7(T) exp [— (V1 + 6V3)/V;(T) ], Vio @) is given approximately 
by V.(é) where, if t; is any convenient scale factor, 


Ve(t) = V.(T) log. [r(T)/ti] — bVe — V.(T) log. (t/hx). (A) 


The function V.(é) is the previously mentioned envelope function for 
each T and V3. It is shown dashed in Fig. 2. (Note log. x = logio 2/ 
logic e.) Although #; may be assigned the same value as t, the two 
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Fig. 2—Schematic illustrating a possible decay of the flatband voltage V;»(¢) as a 
function of logio (¢/t1:) in a hypothetical device for four different applied biases Vz 
at a fixed temperature 7. For simplicity, we set b = 1. Also shown are the envelope 
functions V.(t) associated with each V4. 
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must be distinguished. Our ¢; refers to an initial condition, while V.(¢) 
is independent of the initial conditions, ¢; and Vi, and satisfies Vy,(t) 
<V.(t) (see Appendix C) and 


V(t) SV), t>>7expl[—(Vi + bV,)/V;]. (5) 


When V(t) is plotted versus logy (¢/ti), V; is the slope and 7 is the 
intercept [V.(r) = 0] for zero bias (Vs = 0). From the experimental 
V,.(¢) curves obtained for several V, at a given T, it is possible to 
readily determine 7(T), V.(7’), and b, the three parameters in eq. (4). 
From the envelope function V.(¢) drawn tangent to V;,(t) as shown in 
Fig. 2, we obtain V,(7') from the slope of V(t), b from the difference 
between V(t) (for fixed ¢) at different Vz, and 7(7’) from the value fz 
of t where V,(¢) = 0: 

log. [+(T)/hs] = log. ( #) + yerpy 6) 

a “\ iy V.(T) 

In Fig. 2, ¢2 for Vz = 8 V is shown (é = 5-10+? minutes). It is a test 
of the form of eq. (2) that V.(T), b, and 7(7’) be independent of V» 
to within experimental accuracy. That this is indeed the case is dis- 
cussed in Section IV. We now repeat the above for other temperatures 
in the range 150°C S$ T S 300°C. We find that both 7(T) and V,(T) 
appear to be “thermally activated”’: | 


7(T) = ro exp (Ha/kT), V;(T) = Vs. exp (E;/kT). (7) 


Since the slope V, may arise from a combination of physical processes, 
identifying it with a thermally activated process may be somewhat 
misleading. (The parameter b is a weakly decreasing function of T. 
Since we are interested only in predicting zero bias (V; = 0) behavior, 
we need not concern ourselves further with extrapolating b to room 
temperature. In other words, we can for each temperature use the b 
measured at that temperature to extrapolate to zero bias. Once we 
know the zero-bias result, we can extrapolate to the desired tempera- 
ture. Since b enters only as bV, and V,j is zero, this latter extrapolation 
can be performed without knowledge of b.) The quantities r,, Ha, Veo, 
and #, are obtained from the usual (1/7) plots. We assume, and 
indeed from our discussion in Section II it is not unrealistic to expect, 
that we may extrapolate 7 and V, to room temperature using (7). We 
must now relate these quantities to the characteristic relaxation time 
Trelaxy the charge retention time, of the storage device. 

In Fig. 3 we define rretax(7+) graphically, again for hypothetical 
device parameters. It is simply the ‘‘roll-off”’ time of Vy5(t): Ve(trelax) 
= Vi, where Vi = Vy(t1). [As shown in Appendix C, Vi & Vys(0) 
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FLATBAND VOLTAGE Vip IN VOLTS 





104 105 106 107 108 
TIME t IN MINUTES 


Fig. 3—Schematic illustrating a possible decay of the flatband voltage V(t) in a 
hypothetical device for room temperature (7 = T) and zero applied bias (V_ = 0) 
for two different quantities of stored charge, Vi and V,. The size of the characteristic 
relaxation time 7relax is indicated for both cases. Note that Vi — V{ = Vy — V? 
= 0.69 V;,. 


for t1 K Tretax- | Solving (4) for 7retax at room temperature 7’, for V, = 0, 
we obtain 


Trelax = T(7'.) exp [—Vi/V;(T.) |. (8) 


Indicative of the nonlinearities inherent in the charge-decay process, 
Trelax is a function of the initial stored charge, and hence the initial 
flatband voltage Vi, increasing as V; decreases. (Compare Treiax and 
Trax Corresponding to different amounts of initial stored charge Vi 
and Vj, respectively, in Fig. 3.) Again we emphasize that rrelax cannot 
be obtained from (linearly) extrapolating V(¢) based on its behavior 
for t K Trelax. SINCE Trelax 18 on the order of 104 years at room tempera- 
ture for devices reported on here, it is doubtful whether room-tem- 
perature, zero-bias experiments alone will be able to predict Tretax. 

Although rrelax provides a measure of the charge retention time of 
the device, we must, in addition, know how far V;, has decayed below 
Vi by the time 7yeiax before full significance can be attached to this 
definition of characteristic decay time. Returning to eq. (3) and letting 
ti = 0, Vi = V2, Vs = 0, we obtain for Vy,(¢) under zero bias 





Vale Villon: | ever: (1 aie 8 y"]: (9) 


T relax 
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In arriving at eq. (9), we have used eq. (8). Thus, 


V so (Tretax) = V3 <a Vs log. 2 
Vi — 0.69V,. (10) 


I 


We thus obtain the very important result that, at the roll-off time 
Trelax, V ss has dropped 0.69V, volts below its initial value. This amount 
is independent of V{, the initial flatband voltage. Of course, Vs (Tretax) 
must be several volts positive so that eq. (2) will still be valid. In addi- 
tion, 0.69V, thus defines the decay that must be taken into account 
when including this device in circuits. In determining V, from the 
envelope of a decay curve, one must be careful to convert from log1o 
to log.: 

_ Volts) — Volts) 


Vs logio (t2/t1) 


x 0.434, 


where 0.434 = logio e. 

Another characteristic decay time is the time at which the flatband 
voltage drops below a certain margin voltage V,. We refer to this time 
as the margin time, Tmargin, Or Simply 7m. It follows at once from eq. (9) 
that 7m as a function of V,, and Vi is given by 


tm = T(T){exp [—Vnn/V.(T)] — expl[—-Vi/V.(T)}} = (A) 


as a function of 7 under zero bias conditions. A plot of tmargin iS pre- 
sented in the next section. 


IV. EXPERIMENTAL RESULTS 


Bias-temperature-enhanced, charge-relaxation experiments were 
carried out on the recently devised dual-dielectric, charge-storage 
cells.% Aluminum oxide (Al.0O3, « = 9) was used as the insulator. The 
thickness d, of the SiO. (« = 4) was 125 A, and that of the Al.Os, di, 
550 A. Thus, b = C./C; [see eq. (25)] is predicted to be given by 

_C, _ esio.d di _ 

Ge eae oo 
Experimental values for b ranging from 0.8 to 1.2 were obtained over 
the temperature range of 150°C ST S 300°C. The reason for this 
discrepancy is not known. Some possible explanations are discussed in 
Appendix D. The area of the devices was about 1-10° wm?, and the 
doping of the n-type silicon was about 6-10!° per cm’. Tungsten of a 
density of about 2 to 3-10" per cm? was used to produce the interfacial 
storage sites. 


CHARGE RETENTION IN DDC’S = 1753 


Table | 


Sample* CC) 1/T - 10° Oe ean) Gans b as 
H15-4 150 2.36 15 2.44 2.6- 104 1.2" 3.75-107 
Hist | ooo | aur | (19 | SS | 2830 | ti | “aso 
te | ..280-|||) 461 fy ae cae ae io | 1038 | 17-108 
BEL | oo fom | (8 | EB | EE} | os | oa 


*'H15 is the wafer label and 1 or 4 is the chip label. 
Estimated value. 


Table I presents a summary of the experimental results which we 
have analyzed most carefully. Four representative charge decay curves 
are plotted in Fig. 4. The envelope function V,(t) associated with 
each curve is shown as well. From the V.{t) we obtain the data given 
in Table I. One might do better by attempting to fit the actual experi- 
mental curves with eq. (3), but we have found V.(t) adequate for our 
purposes. 

In studying Fig. 4 one may wonder why the slope of the decay curves 
seem to diminish at higher temperatures. In fact, as is clear from eqs. 


FLATBAND VOLTAGE Vip IN VOLTS 
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Fig. 4—Experimental results under four different conditions: J’ = 150°C, 


V, = 15 V; 250°C, 7 V; 250°C, 10 V; 300°C, 10 V. In each case, the element was 
charged by setting V, = 50 V for 100 us. 
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(2) and (7), |dV;./dé| increases as 7’ increases, as one would expect. 
One must remember that we are plotting Vys(t) versus logio (t/t1). 
Thus, the enhanced decay has shifted V;,(t) to lower t, reducing the 
apparent decay of Vy, when plotted on a logio ¢ scale. The authors 
must admit to having expended some effort themselves getting used to 
plotting Vs, versus log ¢ rather than log V;, versus t, as is common in 
many decay problems. 

By plotting the values of 7 and V, given in Table I as functions of 
1/T, the activation energies H, = 0.61 eV for 7 and #, = 0.065 eV for 
V, can be determined [see Fig. 5 and eq. (7) ]. From Fig. 5 we can 
also calculate 7, and V,,[eq. (7) |: 7. = 2.0 minutes and V,, = 0.41 V. 
Using the E,, ro, E's, Vs. so determined, we use eq. (7) to extrapolate 
7(T.) and V,(T.), where 7, is room temperature (290°K, 17°C). We 
find r(T,) = 7.7 X 10° minutes and V,(7.) = 5.5 V. With these 
values, we can predict the room-temperature, zero-bias relaxation 
times using eq. (8). 

The activation energy E, of 0.61 eV is comparable to activation 
energies ranging from 0.4 to 1.2 eV reported for Frenkel-Poole con- 
duction through oxide layers.**-?8 We conclude, therefore, that #1, may 
be interpreted as being associated with an activation process. Although 
E, is relatively small, being on the order of several kT’s, its existence 
does result in an appreciable variation of V, with TJ (as observed experi- 
mentally). We do not have a simple independent quantitative explana- 
tion of its magnitude and caution the reader against interpreting EH’, as 
being associated with a simple, thermally activated process. 

As discussed in Section III, the charge-retention time of our dual- 
dielectric, charge-storage cell is well-characterized by the device 
“roll-off,” or relaxation time, as defined in Fig. 3. Owing to the non- 
linear dependence inherent in the charge decay, Tretax depends upon 
the “initially” stored charge, which we denote in terms of flatband 
voltages by Vi = Vy(ti). In Fig. 6 we plot rretax as a function of Vi 
for several temperatures of interest. For room temperature and a Vi 
of 10 V, rretax is 3-104 years; for 7 V, 6-104 years. That Tye1ax is a func- 
tion of stored charge should be carefully noted. Lastly, we note that, 
for room temperature at Trelax, Vsy has dropped (0.69) (5.5) = 3.8 V 
below V;. At 80°C the drop is only 2.4 V owing to the reduced value, 
of V, (=3.4 V at 80°C). However, now Treiax for 10 V is only 100 years, 
and for 7 V, 300 years. 

In Fig. 7 we plot rmargin a8 a function of V, for zero bias and for the 
same temperature used in Fig. 6. As is obvious physically, the larger 
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Fig. 5—Experimentally determined parameters 7 and V; plotted versus 1/T to 
determine the activation energies H#, and E;. 


the initial stored charge (initial flatband voltage V1), the longer the 
time required for the flatband voltage to drop to the margin voltage 
Vm, here taken to be 4 V. The horizontal lines give the upper limit or 
largest possible 7 margin for the given temperature. For Vi between 4 and 
5 V, tm goes rapidly to zero, as is clear from Fig. 3. For V; less than 
4 V, tm 1s obviously meaningless. 
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Fig. 6—The predicted roll-off or device relaxation time for zero-applied bias 
(V, = 0) and temperatures of —40°C, 17°C, 80°C, and 100°C plotted as a function 
of ‘initial’ flatband voltage Vi = V(¢1), where ¢; = 1 minute. 


V. CONCLUSION 


In this paper we have discussed a method of predicting the retention 
time of charge in dual-dielectric, charge-storage cells. The method is 
based on the realization that an envelope function V.(¢) exists, that 
this function is determined primarily by the characteristic relaxation 
of the device, and that this relaxation is a strong function of tempera- 
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Fig. 7—The predicted margin time for zero-applied bias (V, = 0) and tempera- 
tures of —40°C, 17°C, 80°C, and 100°C plotted as a function of “‘initial’’ flatband 
voltage V; = V(t), where t: = 1 minute. The horizontal line above each curve is 
the upper limit for tmargin a8 Vi is taken larger and larger. 


ture and applied bias. By choosing an appropriate function to represent 
the dependence of the discharging current on the applied bias and 
stored charge, it is possible to discuss all the interesting features of the 
time-dependent charge decay in terms of simple mathematical expres- 
sions. We were thereby able to predict, on the basis of experimental 
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data obtained under bias-temperature stressing of representative 
devices, the time dependence of the shift in the flatband voltage at 
zero bias and operational temperatures (room temperature to 80°C). 
In particular, we can characterize the nonvolatility of the DDC’s in 
terms of a roll-off or relaxation time 7, and a margin time rm, both 
dependent on the level of initial charging. Although these times are on 
the order of 104 years, they were determined from experimental data 
taken over a period of only a few weeks. This was possible because our 
method of bias-temperature stressing greatly enhances the dominant 
discharging processes, speeding up the charge decay without intro- 
ducing significant decay from otherwise insignificant decay modes. 
Because of this, we feel that our method will greatly facilitate the 
realistic prediction of the nonvolatility of DDC’s. 
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APPENDIX A 


Equations (1) and (2) in the text are admittedly oversimplified, 
despite the ease they render in interpreting the data. Nonetheless, the 
characteristic decay V(t) discussed in the text is approximately ob- 
tained in a variety of cases, e.g., tunneling or Frenkel-Poole, even when 
it is known that the rate of decay is proportional to more complicated 
voltage dependences than exp (V/V,). It is the purpose of this appendix 
to outline why this is so. For simplicity, we assume that conditions are 
such that only one decay mode is important. 

Usually we can write the decay of the stored charge, or, equivalently, 
that of the flatband voltage V, in the following form, 


dV ; 
= -LY + 6%) — fO], (12) 
where, in turn, 
{[V’) = Ae”, VV’ =V+5OYV,, (13) 


A being a dimensional constant and g(V’) a function of temperature 
and applied bias as well as of various powers of V’ (and possibly of 
log. V’). The point of writing f(V’) in the form given in (18) is to focus 
primary attention on the exponential nature of the functional depen- 
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dence of f(V’) on V. Thus, f(V’) can be approximated over a wider 
range of V by a Taylor expansion of g(V’) in (13) than by a Taylor 
expansion of f(V’) directly. 

Suppose at ti << Trelax, V = Vi. Then, assuming f(0) < f(V’) for 
V of interest, we can readily integrate (12) if we approximate g(V’) 
by 9(V1) + y(V — Vi), where y may be chosen in various ways 
(V;= V;+bV;). For example, we may desire upper and lower 
bounds for V (é), in which case we can use upper and lower bounds for 
g(V’). In some cases, y = g'(V1) or y = [g(Vi) — g(V2) 1/(Vi — V2) 
give such bounds. In all cases, we obtain the characteristic decay of 
which eq. (8) is one example. 

A somewhat more appealing approach that avoids approximating 
quantities in exponents is the following.”®. Again, assume f(0) < f(V’) 
to write the integral of (12) in the form 


v2 dV Ve es (V"). 
t —t = — TTF SS 14 
Re ayy a 7’) ne 
According to the intermediate-value ones this becomes 
ine ee ii [raven "(V’) 
So AG Va ati 
1 
= —9(V)) — p-0(V) 
apple ou, (15) 


where V2 < Vs < Vi. (Physically, we must have g’(V’) > 0.) Thus, 
given V; and 4, for each V- of interest, t2 follows from (15) to within 
the uncertainty of choosing g’(V3). (This will normally be, at most, a 
factor of 2 for Ve1/2V;.) The envelope function V.(¢) follows at 
once from (15) by setting ¢, and exp [—g(Vj) ] to zero. 


t = ¢-9(VD/Ag' (V2). (16) 


If we note that V.(t) = V.(t) + 0Vs:, differentiating (16) with respect 
to V» at fixed ¢ and assuming that g’(V3) is only weakly dependent on 
V», it follows at once that 


dV, 


Vor 





Under these conditions, the vertical separation between envelope 
functions V.(t) corresponding to two different applied biases V; and 
V3 (at the same temperature) will be b(V} — V3), even when these 
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envelope functions are not straight lines, but are given in general 
by (16). 
Equation (15) may be rewritten in the characteristic form as 


g(Vs) — g(Vi) = —log. [1 + Aer ?P9'(V3)(t2 — ts) ], (17) 
where we may write 
g(V2) — g(Vi) 9g’ (Va) (V2 — V2). (18) 


Equation (18) follows from the mean-value theorem (V2 < Vs < Vj). 
Thus, while it is clear that the exact solution will in general not be 
precisely given by (17) and (18), such a solution will still show the 
same general features as the above for physically reasonable g(V). It 
should be noted that, since g’(V3) and g’(V4) actually represent aver- 
aged quantities, and since g(V;) is simply an initial condition, the 
results given in (17) and (18) are rather insensitive to the detailed 
dependence of g(V’) on V. 
One final point should be noted. For V sufficiently small and V; = 0, 

we may write 

OVE ap 

a f’(O)V. (19) 
The solution to (19) is the common exponential decay {exp (—t/t.), 
to = [f’(0) ]}“} now with voltage V an exponential rather than a 
logarithmic function of time t. For ¢>> rretax, clearly V(é) will be of 
this form. But this is well beyond the region in time of the primary 
decay of the charge, and it is this primary decay governed approxi- 
mately by (17) to (18), which is of crucial importance for evaluating 
charge storage devices. 


APPENDIX B 
B.1 Decay through the insulator 


Section II indicated that the electric field 6 in the insulator drives the 
charge-decay current [see eq. (1) ]. This electric field arises from the 
stored charge and the applied bias. The purpose of this appendix is to 
express & in terms of two measurable voltages, the flatband voltage, 
which measures the stored charge, and the applied-bias voltage. 

Let C. be the capacitance of the oxide in series with the semicon- 
ductor, and let C; be the capacitance of the insulator. We assume C’, 
(as well as C,) is independent of voltage. (We thus ignore the bias 
dependence of the semiconductor capacitance.) Simple capacitative 
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division implies that, if Vz is applied to the gate and there is no stored 
charge, then the voltage V, dropped across C; is given by 
_y 1/C; 

°1/C; + 1/C, 
On the other hand, if a charge Q is stored and V, = 0, then the voltage 
V, across C; is given by 


Vs (20) 


V. = Q(Ci + C.)7. (21) 
See Fig. 1b. But Q is related to the flatband voltage V,, by the relation 
Q = CV ps. (22) 
See Fig. 1d. Thus, 
an 1/C, 
Ve VT e te i 


The total voltage drop Vr across C;, which produces 6, will in general 
be a superposition of the two; thus, 


_ 3s _s 1/C, 
Vr =V.= YG, FG, (Vip + BV), (24) 
where 
b = C,/C,. (25) 


In passing from eq. (1) to eq. (2) in the text, we have let V;(T) absorb 
the prefactor (24). Thus, V,(7’) depends both on the physical processes 
of the decay and on the geometry of the device. 

It is important to note that b can be greater than as well as less than 
unity. In principle, b can be computed from knowledge of C, and C;. 
It is more easily obtained from the measured decay curves, a more 
reliable method. Its slight temperature variation is somewhat of a 
puzzle. If at higher temperatures additional capacitances C; and C, 
arose in series with C; and C,, respectively, then b would become b’ 


given by 
»_ Cof1+C/Cr\. 
® =e) ee 


One expects the semiconductor portion of the device C, to be more 
susceptible to degradation than the insulator side C;. If C; = © and 
C, < , then b’ < b, as observed. For lower temperatures, b is close 
to its value predicted by (25). Fortunately, b is determined within 
experimental error at a given temperature to be independent of V4. 
Thus, at each temperature we can predict zero-bias behavior. We can 
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then predict room temperature, zero-bias behavior from higher tem- 
perature, and zero-bias behavior without knowledge of b. This is be- 
cause only the factor OV, enters eq. (2), and V, = 0 for zero bias. 


B.2 Decay through the oxide 


If the primary discharging current is through the oxide (rather than 
through the insulator, as assumed throughout this paper), then the 
electric field that produces this current must be calculated from the 
voltage across the oxide. Using the notation introduced above, we 
note that the portion V; of the bias voltage V, dropped across the 
oxide is 

1/C, 
Ly C; + 1/ C, 
On the other hand, it is clear from Fig. 1b that the contribution V, of 


the voltage resulting from the stored charge is just Va given by (28). 
Thus, the total voltage drop V7 across C, is given by 


1/6, 
VG, +e. (V — V3). (28) 


Thus, unlike the case of Vr (24), no geometrical factor analogous to 
b enters. 


Vi=Vs (27) 


Vr=Vi,-V,= 


APPENDIX C 


In this appendix, we derive two minor results used in the text. 


() To show that Vy,(t) < V.(¢), we need only compare the argument 
of the log, in eq. (3) with 4, = 0 and V; = V? with that of eq. (4) when 
written 


=f 
V. (t) = V; log. ( 0 + : eb Vo! v) : (29) 
From (3), we have 
-1 
V y(t) = J), log. | (er + ~ eovave) i (30) 


As the argument of the log, in (30) is less than that in (29), it follows 
at once that 
Vyo(t) < V(t). (31) 


[ Additional decay mechanisms important for very short times (<1 
minute), which are not included in (80), will, of course, reduce Vy, 
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even further. | It may be noted that V; and t; may be chosen so that 
Vy.(€) given in (3) exceeds V,(¢t) and Vy» approaches V, from above. 
Such unphysical behavior would result if one mistakenly chose a value 
of t1 too large; that is, if, after completing the charging of the device 
(t = 0), one recorded V; at ¢, using a clock that read a time sufficiently 
larger than 0 at ¢ = 0. 

(22) To show that Vi} V?¢ when t; K Tretax, we calculate V, from 
eq. (8), in which we set Vi = V?, 4 = 0, and V, = 0. Then it follows 
that 








-1 
Vi = V, log, | en (a + = ) |: (32) 
2 relax 
If now ti: < Tretax (the usual case), then 
ty 
WiwvVv- Vs. (33) 
T relax 


Since V, < V@ and #4: 10 rye1ax at most (for room temperature and 
zero bias 10~7 is typical), we may use Vi & V{ to a very good degree 
of approximation. The actual, initial, flatband voltage may be larger 
than V{, owing to other decay mechanisms which may be important 
for ¢ < 1 minute. It is Vi [or V? obtained from V, using eq. (3) ], how- 
ever, with which we are concerned. 


APPENDIX D 
The “b-Factor’ 


The disagreement between the predicted b-factor (25) of 2.0 and 
the much smaller measured b-factor (Table I) of 0.8 to 1.2, which we 
shall call b’, calls into question our assumption that the electric field 
in the insulator is spatially constant. This follows because, if the field 
is spatially constant (and if the current per carrier depends only on 
the electric field, as is physically reasonable), then the measured and 
predicted b-factors, b’ and b, would agree. Since, in fact, they do not 
agree, we conclude that the field is not spatially constant, a result, 
perhaps, of charge stored in the insulator. It is the purpose of this 
appendix to investigate some consequences of charge stored in the 
insulator (in addition to charge stored at the oxide-insulator interface) 
on the decay of the flatband voltage Vy». 

Let us begin by asking what information our flatband-voltage mea- 
surements tell us in light of the possibility of having charge storage in 
the insulator. Referring to our empirical result (2), we note that, ata 
fixed temperature, the rate of decay of Vy,(é) is a function of 
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[Vyo(t) + b’'V,]. Thus, whatever electric field governs the decay of 
V y(t), it too must depend on Vy, and V, in the form [Vys(é) + b’V, |. 

The next step is to express the various fields and voltages of interest 
in terms of the stored charge. Let Q,(t) be the charge stored at the 
oxide-insulator interface at time t, p;(x, ¢) the density of charge stored 
in the insulator, ¢; the dielectric constant of the insulator, and e, that 
of the oxide. It follows (see Fig. 1d) that the flatband voltage is 
given by 


Vis ant oe f — (ade (34) 
= V,(t) + Van(t) (35) 
= f a Cee (36) 
=Vr(t), (37) 
where 
pr = pit (Q;/A)6(z). (38) 


Here A is the cross-sectional area of the device, d; is the thickness of 
the insulator, 6(x) is the usual delta function, V, is the contribution of 
Q; to Vys, and V,, (the m referring to moment of charge) is the con- 
tribution of the charge stored in the insulator to Vy. (Throughout our 
discussion, we are assuming that no charge is stored in the oxide.) A 
straightforward but more involved calculation leads to the electric 
field in the insulator, 0 < x < d;, under general charging and applied- 
bias conditions: 


= [Oa PE 9 -5)| (39) 


e.. ce Vo a , P(x’, t) x 
=a+y lors f dig! Pri 2 (+55) 


= [ag ED 9 (1 -=)|. (40) 


If we integrate (39) or (40) over the insulator, we obtain the average 
field times d;, the voltage drop across the insulator Va(t) given by 


Vat) = (1 + 6) LV) + Vint) + 0V5] 
(1 + b)"LVs(t) + bVe]. (41) 
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Using these results, we can discuss what physical effects can and cannot 
lead to the observed behavior (b’ < 6). 

Owing to the presence of charge in the insulator, the electric field 
at the oxide-insulator interface will be reduced from its value in the 
absence of such charge. If H(Ot, t) controls the decay, perhaps this 
reduction can lead to a reduced b-factor. Unfortunately, this effect 
would result in (b’ > b) rather than (b’ < b). To see this, we note 
from (89) that 


E(Ot, t) = (1 + 6) d;"[V.(é) + dV, — OV nC) ] 


= ata | 5 70 — UVa +075): (42) 


If this expression is to be a function of [V;s,(t) + b’Vz], it is necessary 
that 


Vyo(t) = (0'/b)V.(t) — b'V a (2). (43) 
To satisfy both (35) and (48), it is necessary to have 
—_ 1/b — 1/0’ 
Vnlt) = “Tb +1 Vyo(t) (44) 
and 
ayy 
V;(t) = Tbh Vyo(t). (45) 


However, if b’ < b, then 1/b < 1/0’, and the coefficient of Vy,(¢) in 
(44) is negative. This is rather disturbing since, as a result of the initial 
charging and subsequent decay of the charge through the insulator, 
one would expect V,,(é), as defined in (35), to be positive. In addition, 
(44) implies that, as Vy, decreases during charge decay, Vm» must 
increase algebraically. This can occur only if charge stored at the 
oxide-insulator interface becomes trapped in the insulator neutralizing 
the charge trapped there. This effect would be enhanced if more charge 
were neutralized near the interface (2 = 0) than in the middle of the 
insulator. Although this may be the source of (b’ < b), the origin seems 
physically unreasonable of the bulk trapped charge of sign opposite 
the stored charge which, although not neutralized by the charging 
current, is neutralized by the decay current. We conclude that, if 
(b’ < b), then the field at the interface probably does not control the 
charge decay. 

We note in passing that, since the voltage drop Va(é) across the 
insulator (41) is already a function of [Vs.(t) + bV2], it also cannot 
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be the average field in the insulator which is controlling the decay. 
This hardly requires further elaboration. 

One effect that charge stored in the insulator has on the decay is to 
enhance the electric field near gate (x = d;) relative to the field near 
the interface (x = 0). This enhancement may be sufficient to remove 
charge stored in the region x, < x < d; of the insulator during the 
initial portion of the decay, leaving significant stored charge only in 
the region 0 < x < 2, of the insulator and at the interface. If, during 
the remainder of the decay, the decay rate is governed by the electric 
field at x., we then expect to observe (b’ < 6b). Let us see how this 
comes about. 

If p:(x, t) = 0 for x > 2z,, then the electric field at x = x, is, accord- 
ing to (39), given by 


E(x, t) = an eB V.(t) + b/V, 
fs alc ae Ay 2 as ov 40 || , (46) 


where 


QQ = Af ar'os(a', 0 (47) 


=A i) “ dx'p;(x’, t), (47a) 
0 


the size of the stored charge in the insulator. Alternatively, using (40) 
for the field, we obtain 


a bE ay bY Q® 1» 
Ben) = qr pa | vv +5 a+H +o], as) 
where Q(t) is the total stored charge defined by 


Q@) = Q:() + Q.(2). (49) 


For E(z,, t) to be a function of (Vs, + 6’Vz), it is necessary that 
[using (37) ] 


Vr = -0'Vr th +0) 20 (50) 
or that 
_ 14+1/b Qt) 
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Now (b’ < 6) implies that C:Vr(t) < Q(é), which means in turn that 
some charge js stored in the insulator, a consistent result. 

Having seen how a measured b-factor (6’) can arise which is less than 
the computed b-factor (b), we must inquire as to whether the effect is 
sufficient to explain the measured results. For simplicity, let us assume 
that p; is uniform for 0 < « < x,. Then it follows that 


ae e % a (1 — 2,/2d,). (52) 


It then follows from (48), (51), and (52) that 


Q/C: _ 1+ Q/Q. 1+ to 
Vr 1+ @/Q)1—a,/2d) T+1/b 


If we consider the case at hand where b = 2.0 and b’ is essentially 1, 
then (53) implies that x, must be at least 0.50 X d;, that is, that the 
stored charge in the insulator must extend at least 50 percent of the 
distance from the interface to the gate. If Q./Q; is 1.0, then x, = dj. 
Thus, Q./Q; must be at least 1.0 for, if it were smaller, then x, > di, 
which is not possible. Therefore, if the charge stored in the insulator 
is uniform between x = 0 and x = a,, then it is necessary to under- 
stand the observed 6b’ that 0.50 < x,/d; <1 and 1.0 < Q:/Q;, the 
latter implying that more charge must be stored in the bulk at the 
insulator than at the interface. 

In the preceding paragraph, we have assumed that the stored charge 
was uniformly distributed in the region 0 < x < 2,. In fact, we expect 
the charge to be more dense near the interface (x = 0) where the field 
is lowest than near x = x,. To satisfy (51), this would require larger 
values of Q./Q; and of x, than for the uniform case. We have also 
assumed that it is the electric field at x, that controls the decay. It is 
possible that H(z, t) for x < x, in fact performs this function. This 
would further increase the values of Q:/Q, and x, required to achieve 
(b’ < b). One’s lattitude here is rather limited, however, for, as we 
have seen, if it is H'(0*, t) which controls the decay, then Q./Q, becomes 
negative. We offer the above, therefore, as possibilities only. 

Another source of the (0’ < b) effect is that the charge may be 
extraction-limited, that is, controlled by the field at = d;. If we put 
2 = d; in eqs. (46) and (47a), then we again obtain (51) relating the 
total stored charge to its first moment. We noted above that we could 
understand the measured b-factor for x, = d; and a uniform, stored, 
insulator charge if Q:/Q; = 1.0, a reasonable but perhaps somewhat 





(53) 
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large value. However, if the current is extraction-limited, then we 
expect that the charge would be more dense near x = d; than in the 
bulk of the insulator. As a hypothetical example, suppose that the 
stored, insulator charge is uniform for 71 < x < d; and zero for 0 < x 
< 2,1. Then from (37) we obtain 


4% ,Q1/,_ a), 
Vr = Cas (2 3) (54) 
It then follows from (49) and (51) that 

Q/Ci _ 1+ Q/Q: _1+1/0' 


Vr 1+ @/Q)0 —a/d)/2 1+ 1/6 (55) 


This relation provides a much greater possibility for obtaining (b’ < b) 
than (53). For example, if Q; > Q,, then 


b’ S$ (1+ 2/6) < 6/2. (56) 
Or, if 71 } d;, then 
b’ = bL(1 + 6)Q./Q. +147, (57) 


from which b’ = 1 would follow for b = 2.0 if Q:/Q, = 0.338, a very 
reasonable value. For 21 < d;, somewhat larger Q,/Q; would be re- 
quired to satisfy (55). However, in most cases Q:/Q, would be smaller 
than that in the previous example (53), in which the stored charge 
was assumed near the oxide-insulator interface. We conclude that, 
while we have indicated the possibility of how (b’ < b) can come about, 
further work is required to really pin down and calculate the measured 
b-factor b’. 
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This paper describes the noise performance of input amplifiers for 
optical pulse-code-modulation repeaters. The noise ts treated in terms of 
an effective noise generator in parallel with the photocurrent induced in the 
detector and the effective noise, in turn, is related to error performance. The 
analysis applies to both conventional and integrating front ends. Both 
field effect and bipolar transistor amplifiers are treated. For the latter, an 
optimum bias current that minimizes the effect of thermal notse is derived. 
Finally, predicted and measured performance are compared for silicon 
field-effect transistor input amplifiers at 6.8 Mb/s and 50 Mb/s, and for 
bipolar transistor and GaAs field-effect transistor input amplifiers at 
274 Mb/s. 


Il. INTRODUCTION 


The factors which limit the performance of an optical receiver are 
optical quantum noise, leakage noise of the detector, thermal noise 
introduced by the detector load resistor, and various forms of noise 
introduced by the input amplifier. If an avalanche detector is used, 
the leakage noise has two components—one which is gain independent 
and the other which is gain dependent. In addition, the gain process 
introduces a signal-dependent noise. In high-speed pulse-code-modu- 
lation (PCM) receivers, input-amplifier noise plays an important 
role in the determination of system performance. If leakage current 
is negligible, it can be shown! that without avalanche gain the 
required signal power to achieve a given error probability varies as 
the square root of the thermal noise power and with optimum avalanche 
gain with the sixth root of the thermal noise power. Furthermore, the 
optimal avalanche gain varies as the cube root of the thermal noise 
power.’ 


* These results assume an excess noise coefficient of 0.5. 
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In applications employing a detector with a capacitive impedance 
such as nuclear particle counters’ and television camera input ampli- 
fiers, an approach has been employed in which the input circuit 
integrates the signal and the signal is equalized after amplification. 
Personick? has analyzed the performance of PCM repeaters with 
integrating front ends and Goell has experimentally verified some of 
his results at 6.8 Mb/s.® 

The noise performance of an amplifier is often stated in terms of 
noise figure. This approach is attractive when thermal noise can be 
reduced by matching the source impedance to the optimum noise 
impedance of the amplifier. For optical receivers and other systems 
with a capacitive source the noise of the source is often a negligible 
portion of the total thermal noise, and the noise contributed by trans- 
former losses would more than negate the advantage gained by trans- 
formation. For this case, noise figure, which is inversely proportional 
to source noise power, is a poor figure of merit and it is better to treat 
the noise directly. 

The approach chosen here is to describe the noise of the amplifier 
by an equivalent input noise current generator which produces the 
same noise at the output as the internal sources of the amplifier. The 
intermediate step of finding a short-circuit input noise current gener- 
ator and open-circuit input noise voltage generator is dispensed with 
here because it does not contribute to physical understanding when the 
noise figure concept is not employed. 

If the linear portion of the receiver is modeled by a cascade of an 
amplifier, an equalizer to give a flat frequency response in the band of 
interest, and a filter to set the noise bandwidth and control the pulse 
response, then the equivalent current is a particularly convenient 
way to express the noise. This approach is conceptually simple, the 
effect of equalization is implicitly included, and filtering has the same 
effect on the equivalent noise as on the signal. 

The error performance of a PCM receiver can be related to the ratio 
of the peak signal to rms noise ratio at the regenerator. It will be shown 
that this ratio is equal to the ratio of the average received signal power 
to an effective noise current which can be readily derived from the 
equivalent input noise current. 

In the previously mentioned work by Personick the amplifier noise 
was modeled by the series voltage and shunt current noise generator. 
Results were given only for field effect transistor (FET) input ampli- 
fiers. In this paper the noise of both field effect and bipolar transistor 
amplifiers is analyzed. The effect of spreading resistance and load 
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EQUALIZER FILTER 





Fig. 1—Typical input circuit of an optical repeater. 


noise is included and the application of the method to cases in which 
the noise of more than one stage is significant is described. The ap- 
proach is used to compare the performance of silicon bipolar and field 
effect transistors and GaAs field effect transistors as a function of bit 
rate. Theoretical predictions are compared with measured results 
for silicon FET input amplifiers at 6.8 Mb/s and 50 Mb/s® and for 
bipolar transistor and GaAs FET input amplifiers at 274 Mb/s. 


Il. CALCULATION OF EQUIVALENT NOISE 


A model of the initial stages of a typical optical receiver is shown 
in Fig. 1. The resistor R, is provided to return the detector bias current 
to its source. Equalization is provided to compensate for the frequency 
dependence of the amplifier gain, and filtering is provided to limit the 
noise bandwidth and shape the signal. For the cases to be described 
here, it is convenient to think in terms of an equalizer to give a flat 
frequency response followed by a filter which describes the frequency 
dependence of the transfer characteristic of the receiver, although in 
practice they can be combined. In some applications, such as where 
matched filtering is to be employed, another approach might be used. 

The detector circuit without avalanche gain can be modeled as 
shown in Fig. 2. Here 7, is the induced photocurrent, C; the detector 





Fig. 2—Detector circuit. 


OPTICAL PCM RECEIVERS 1773 


lin 


Zi { A(w)i Zo 


Fig. 3—Equivalent noise circuit. 


junction capacitance, R.g the diode spreading resistance and S, the 
noise current spectral density associated with it, C. the capacitance 
of the interconnection circuit, S, the noise current spectral density of 
the de return resistor, and S; the spectral density of the gain-inde- 
pendent diode-leakage noise current.* With avalanche gain, 7, is the 
photocurrent multiplied by the mean gain, 9g. 

The performance of any linear amplifier can be described by the 
equivalent circuit of Fig. 3. Here, the impedance Z; is the parallel 
combination of the input impedance of the circuit and the output 
impedance of the preceding circuit, and A(w) is the current amplifi- 
cation of the stage. S.q is the frequency-dependent equivalent noise 
current spectral density of the noise current generator which when 
applied to the input gives the same output noise spectral density as the 
internal noise generators. 

An equivalent circuit which applies to both bipolar and field-effect 
transistors is shown in Fig. 4. Spreading resistance has been ignored 
for the present, but will be analyzed later as a separate stage. 

The equivalent noise current spectral density is given by 


Seq = Seqi + Seq2; (1) 


where S_qi is the contribution to Seq of Si, the emitter (source) noise 
current spectral density, and Seg2 is the contribution to Seq of Se, the 
collector (drain) noise current spectral density. The calculation of 
Seq: and Seg for common emitter (source), collector (drain), and base 
(gate) stages is described in the appendix. A comparison of amplifier 
configuration is also given. 

At low frequencies, that is, when the transit time of carriers through 
the device is short compared to the period of the signal, for a field- 
effect transistor at pinch-off 


* The effect of the gain-dependent leakage noise has been described by Personick.? 
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Fig. 4—Equivalent circuit for both bipolar and field effect transistors. 


and 
Se = AkTT gm, (3) 


where T is the temperature, gm is the transconductance, J, is the gate 
leakage current, c, is the gate capacitance, and k is Boltzmann’s 
constant.’ For junction field-effect transistors, T = ? and 6 varies 
from yo to 74.8 With currently available devices, drain noise has 
been found to predominate with an untuned input circuit. 

At very low frequencies account must also be taken at 1/f noise. 
The 1/f noise region for junction field-effect transistors extends to a 
few hundred kilohertz, while for metal-oxide semiconductor field 
effect transistors it can extend to above 10 MHz. For GaAsFETs 
intervalley scattering adds an additional component of noise. For this 
case T = 1.1. 

For bipolar transistors with short base transit time, the input and 


output noise generator spectral densities are given by 
Si _ 2els, (4) 
So = 2el., . (5) 


where J; is the base current and J, is the collector current.’ The above 
applies for the case of short base transit time and the base current 
much greater than the reverse base saturation current. 


lll. EFFECTIVE NOISE IN PCM RECEIVERS 


For PCM systems, performance is measured in terms of required 
power to achieve a specified error probability and the error probability 
is determined from the ratio of the peak signal to the rms noise at the 
regenerator input. In this section the relationship between the peak- 
signal-to-rms-noise ratio, the average received signal, and the equiv- 
alent noise will be described. 


OPTICAL PCM RECEIVERS 1775 


We shall assume that the detected photocurrent, an undistorted 
version of the transmitted signal, is given by 


where 


2 Tibilavhp(t — kT»), 

i it~ ET ar 1: 

b, = 0, 1 depending on the signal statistics, T, is the bit interval, and 
I,vT'y is the detected charge if a single pulse were received. It is assumed 
that the transmitted pulses are distinct. 

The output signal is given by 


> bihmho(t — kT), 
k=—00 


where h,(t) has been normalized to have unit peak amplitude and hy», 
is the peak output signal current. It will be assumed that intersymbol 
interference is negligible, that is, that at the sampling instant, t,, 

hots — kT.) = 0 


k= +1, +2,---. 
transfer function is given by 


eae hmH (w) 


An inconsequential time displacement has been ignored. Then the 
TrlavH p(w) ’ 
is given by 


(6) 
where H,(w) and H,(w) are the Fourier transforms of h,(t) and h,(t), 


respectively. The mean-square noise after amplification and filtering 
to-rms-noise ratio 


=e 1 -) 
= = i. | A () [7Seq (w) deo. 


where 


sgh 


‘lF 
ails 


i) 
< 





Substituting the mean-square noise into eq. (6) gives the peak-signal- 








’ 7 
e 2 

on <= 2rTs Jo mG] Bead. 
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(8) 
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Thus the quantity VE, is an effective noise current which relates the 
thermal noise to the average induced photocurrent. 

For the case where most of the noise originates in the first amplifier 
stage we will see that under conditions which apply in several cases 
of interest the equivalent noise can be expanded in the Taylor series 


Seq(w) = 2 a;(w) Ij. 


7 
It is then convenient to put the effective noise in the form 


ten = fo DO a; (2afe)49;, (9) 


j 


ee 2 
5; = ym f 


and f, = 1/Ty. The normalization ratio was chosen so that the 9’s 
depend only on the shape of the pulse relative to the bit interval, not 
on the bit interval. 

It can be shown? that for binary PCM with an avalanche detector 
the power required to achieve a specified error probability, P., is 
given by 


where 
2 
H.(w) wide (10) 


H p(w) 











_ Ql Oh. 25 
P pr Qn ig T Fe left | » (11) 
where 
= detector quantum efficiency, 


electronic charge, 


® 
I 


optical frequency, 


detector quantum efficiency, 


DB xr 
i 


mean detector gain, 


I 


mean-squared detector gain, 


SJ] any} 


and 
G = v2 erfce!(2P.). 


The function erfe is the error function complement. For a gainless 
detector when thermal noise predominates, the first term in the bracket 
of eq. (11) can be neglected. 
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A value of g exists! which minimizes p. The ratio of g?/g? can be 
approximated* by g*. Then the value of g which minimizes p is given by 
Ovi2, \Me@H " 

on = (can) 


and the required power at optimum gain is given by 


= hyQ*2/(@+) ( ref, \W Cet) 1 <z-2! Q242) 
Pope = BO (3) (142) : (13) 


IV. APPLICATIONS 


In this section, the effective noise for both field effect and bipolar 
transistors will be described. Emphasis will be placed on conditions 
encountered in receivers operating in the 1-Mb/s to 1-Gb/s range. 
It will be assumed that most of the noise originates in or before 
the first amplifier stage. Corrections for the load, subsequent stage, 
and spreading resistance noises will be included. 

Both detectors and transistors have distributed junction capacitance 
and series resistance followed by a case capacitance. However, at the 
frequencies to be covered here the equivalent circuit can be approxi- 
mated by capacitors across the detector source and transistor base 
(gate) separated by the total spreading resistance of both devices. 
The spreading resistance can be considered as a separate stage with a 
transfer function A(w) given by 


1 


Be) =a OR, 


and noise current spectral density 
S, = 4kTw°CGR,, 
where Cq is capacitance across the detector source. 
The gain-independent leakage current Jz can be accounted for by a 


noise spectral density 
Deg = Qela 


in parallel with the detector current generator. Noise of spectral 
density S, is contributed by the return resistor, R,, which will be as- 
sumed to be on the amplifier side of R,. This assumption is valid as 
long as the contribution of R, to the noise is small. 


* Silicon detectors with x between 0.3 and 0.5 are available today commercially. 
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The total equivalent noise current spectral density can be written as 


Seat + S; + Seq2 


eg a S; + Seq + [A () |? 


= 4kT.°CiR, + 2eLa + 





1 
4kT fea A re Bi] 
[A(w)[?& gm Re  — |gm%Zql? J’ 


where 
] 


Ca 
C, = é. + Coa, 
and R, is the amplifier load resistance. Thus, 


1 + wCoR’ ner (1 + w°C2R?) 


Seq & 4AkT | orCaR, +5 eA 28 
1 
T'9m + Dp 2 
+ oy {(# — u?C,Cak.) 


2 
+ (C, + CaF ) | + 2ela, (14) 
where 
C; = Ca + Co + Coa. 
In practice R, can be made extremely large so 


Rs 
R < 1 


and, from eq. (9), 


1 
ae) 
R, EN fxd ° 


| ela ee ; 
ae (2zfs) Jo + ee. Ci 


+ (27f.)4 ale Fi 2 
ao) ae eae + 9g . (15) 


From this relation, it is clear that, from the standpoint of thermal 
noise, #, should be as large as possible, even if signal integration takes 
place. 





= 2elafoTo + ne 
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For junction field-effect transistors such as the 2N8823, the gate 
noise is negligible. Assuming R, is large and the detector leakage 
current is negligible gives the simple relation 


Den a SkT f, (2rfeC 1)? Se 
ef" 9}. 


a7. (16) 


Thus, for this case which is typical for junction FETs operating at a 
bit rate below 25 Mb/s, the quantity 


Gm 
Ci 


is a figure of merit for the required power without avalanche gain. 
Under the above assumption, the effective noise current increases 
with the 3 power of the bit rate. 

We now turn our attention to the bipolar transistor. Unlike the FET, 
for a bipolar transistor the optimum bias is dependent on frequency. 
For a bipolar transistor, 


Seq = 2elp, 
2el ¢ + a 
Seq2 = eRe ’ 


where Z;, the impedance across the base, is given by 


1 Ca 
Za i + i0(% TT oR) 
and 
kT 
k= oie 


Assuming that the de current gain equals the ac current gain, the 
equivalent noise current spectral density is 
eas + Seq2 
| A (w) |? 
Ly (1+ 5) 4 AAD 204 + w?| ARP CORR, + 2clsC3R? 
CRr 


zs (“4 + we) (cS, ) 4+ C3R? + 2C3R, =| 


BR 1 
2eln , AKT \ we n2pe ae 
Sis ro) (= B ae aR, \CICHRY( els 


Seq = Ss + Sa + 
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Then 


ey = afe| {(1 + a + ah, a 1| 54+ (2rfs)? 


+ cts +(3 + an) Le(Gr,) + cea +2088. (ZZ) |} + 


+ er (-+ 22.) ae (5) |] 


At values of base bias which will shortly be shown to lead to minimum 
effective noise at frequencies up to about 1 GHz, 


Rs 
R, <K a 


(Qrfo)?CACiRII4 K CPR», 


2kTCiR; 
e 





and then 


= 2kT 
i 2pe| {(1 a: 5 )i 13+ la} 


2kT CRs I, , 2kT kT 
+ (2rfr)? | aoe ae (3 =r inary ) Ct (= ae )| 32: (17) 


The base capacitance is the sum of the diffusion, depletion, and 
stray capacitances. The diffusion and depletion capacitances increase 
with base bias and the stray capacitance is bias independent. We will 
approximate C, by 





Ci. Cu + Cals. 


The optimum bias current is then found by differentiating 72; how- 
ever, a third-order equation results. Assuming most of the noise 
originates from the first stage (Rz ©), the optimum bias current is 


So 
CoN TE BS. (18) 


ae 








- ‘ os 


The second radical is close to unity under most practical conditions 
(e.g., up to 1 Gb/s with 6 = 100, Cal, = 1 pF) and thus J,, is set by 
the zero bias capacitance. Substituting J,, into 72,, again assuming all 
of the noise originates in the transistor, gives 


tn = = Lett o ie {1 at “(1 “+ _ eh , (19) 
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where 


Te = SES I (145): (20) 


The term CJ; is the diffusion capacitance at optimum bias. 

We can now go back and determine correction terms for the noise 
contributed by the series and load resistance terms at Jy. The ratio of 
the effective series noise current to 724, is given by 


tents Ca [9 
an UnfCaRe Ga] 5B (21) 


ett o 


and the ratio of the effective load noise current to i27, by 


<3 
VeftL wy 4kT 
Ono ~ BeR ile. ve 


The input time constant at optimum bias is given approximately by 


Ce (AFB) Ts. 
RC: as C. gg Or (23) 


Since J, > 92 and B > 1 for practical conditions, the time constant 
will greatly exceed the bit interval. Finally, the current gain at opti- 
mum bias is given by 


CS ee 
1 tog me wl; (1 + B)So 
PoC, V Se 
Thus the transfer function is a function of w/w, and, for a fixed pulse 


shape and £, the frequency dependence of the current gain, A(w), 
scales with bit rate and its magnitude is independent of bit rate. 


V. EXPERIMENTAL RESULTS 


The preceding analysis will now be compared with experimental 
results that have been obtained by Goell at 6.8 Mb/s and 274 Mb/s 
and by Runge at 50 Mb/s. 

The 6.3- and 50-Mb/s repeaters employed 50-percent-duty-cycle 
rectangular return-to-zero (RZ) optical pulses. For 274 Mb/s, the 
optical pulses were nonreturn-to-zero (NRZ). For the 6.38-Mb/s 
repeater the baseband pulses were RZ and for the other two repeaters 
the baseband signal was NRZ. In all of the cases the baseband pulses 
were close to the shape to be expected with a raised-cosine spectrum, 
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Table |— Signal formats and pulse shape integrals 


Transmitted Equalized 


Pulse Pulse So 52 
Case I 50 percent RZ 0.30 0.13 
uty cycle 
Case IT 50 percent NRZ 0.40 0.036 
duty cycle 
Case III NRZ NRZ 0.55 0.085 


but with some intersymbol interference at the highest bit rate. For 
the purpose of this paper we shall assume a raised-cosine spectrum. 
Since the error rate is a rapid function of the signal level, the effect 
of intersymbol interference on error rate can be accounted for by a 
reduction of received average power equal to the reduction of the 
worst-case pulse-pattern signal to threshold level. The integrals 9, 
and J. are summarized for each of the signaling formats in Table I. 

For the 6.38-Mb/s and 50-Mb/s repeaters, SiFET front ends were 
employed. In the former case, gm was 0.006 mho and the total circuit 
capacitance 8 pF and for the latter these parameters were 0.006 mho 
and 6.7 pF, respectively. For the 274-Mb/s repeater two front ends 
were tested; the first with a common-emitter followed by a common- 
collector stage, the second with a common-source GaAsFET input 
stage followed by an emitter follower. In both cases the input capaci- 
tance was 4 pF. The bipolar transistor used was an FMT4000. 

Figure 5 shows the base current vs collector current for the 
FMT4000. The current gain for this device is 160 for base currents 
to well below 1 pA. 

Both the calculated and measured effective noise current is shown 
in Fig. 6 for the FMT4000. The experimental values were inferred 
from the power which gave the signal-to-noise ratio experimentally 
determined to give a 10-° error probability. Experimental curves are 
given for the input transistor and for the complete receiver. At very 
low bias currents the noise originating after the first stage becomes 
significant. Near optimum bias, which was calculated to be 5.5 wA, 
the required signal is increased by about 0.5 dB by noise originating 
after the first stage. 

The measured optimum for the first stage is about 4 wA indicating 
the base noise is somewhat higher than assumed in the theory. The 
error-rate performance was found to be optimum at about 7 wA with 
about 23 dB more power than predicted from the calculated first-stage 
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Fig. 5—I, vs Ip for an FMT4000 bipolar transistor with 10 V collector bias. 


and measured subsequent-stage noise. The puise response for pseudo- 
random data indicated that about 15 to 2 dB of this discrepancy were 
due to intersymbol interference. 

The measured and calculated effective noise currents for all of these 
bit rates are summarized in Table II. For the GaAsFET case an 
FMT$901 transistor was used with a gm of 0.016 mho. The leakage was 
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O MEAS. TOTAL NOISE 


@ MEAS. FIRST-STAGE NOISE 


ig (LA x 10-2) 





0 1 2 3 4 5 6 7 8 9 10 
Ib IN BA 


Fig. 6—Effective noise current vs base bias current for an FMT4000 bipolar 
transistor. 


under 1 vA, which is negligible for 274 Mb/s. Only about half the 
total noise originated in the GaAsFET. This accounts for a significant 
part of the error. The noise of the later stages could have been reduced 
by the inclusion of an extra stage in the front end. However, the 
performance would still have been inferior to that of a bipolar transis- 
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Table I|— Comparative performance 


Mea- | Mea- Cal- 


: z sured | sured 
Big || Ptane-: Banal Total | First | Cule- 


Rate mitted ized Device ech 0) B | Cr! .FS | Stage tion 





Pulse | Pulse Se 
(Mb/s) Shape | Shape aan V2. dott 
eff (n A) 
(nA) 
6.3 | 50% RZ| RZ SiFET 6 8.0| 0.46 0.40 | 0.39 
50 50% RZ! NRZ | SiFET 6.4 6.7| 4.2 7 3.68 
274 NRZ NRZ | BP 160 | 4 49 42 23 


274 NRZ NRZ | GaAsFET; 16 4 | 79 55 36 


* Not available. 


tor. The theoretical and experimental effective noise currents shown 
in Table II for the 6.3-Mb/s and 50-Mb/s cases are in close accord 
with theory. 


Vi. COMPARISON OF DEVICES 


Several considerations enter into the selection of an input stage for 
an optical receiver. Among these factors are sensitivity, dynamic 
range, power consumption, temperature stability, and cost. Each of 
these factors, with the exception of the last which is beyond the scope 
of this paper, will now be discussed. 

Equations (15) and (20) indicate that the effective noise for bipolar 
transistors and field-effect transistors without leakage have a different 
dependence on frequency. Since the rate of noise increase with fre- 
quency is lowest for bipolar transistors one would expect them to be 
best at high frequencies. At lower frequencies FETs could be expected 
to be superior. GaAsFETs have appreciable leakage currents. With 
the FMT901 transistor the leakage varies between 0.1 and 100 yA, 
though it is typically below 10 uA. Thus the noise performance degrades 
at low frequencies. In addition, 1/f noise may be a problem. To date 
little is known about this source of noise in GaAsFETs and the possi- 
bility of its being significant at bit rates on the order of 10 Mb/s 
cannot as yet be ruled out. 

Figure 7 illustrates the dependence of the noise of SiFETs, 
GaAsFETs, and bipolar transistors as a function of frequency. The 
leakage current noise, shown separately, must be added to the device 
noise to get the total circuit noise. This is especially important for 
GaAsFETs where the leakage of the device can be the limiting factor. 
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Fig. 7—Effective noise current vs bit rate. 


The second case of Table I (50-percent-duty-cycle optical pulses, 
NRZ baseband) is chosen to illustrate the behavior to be expected. 
It is assumed that the device parameters of Table III apply. The FET 
curves terminate at the frequency where g,Z; = 1. From the curves 
for this case bipolar transistors are superior to GaAsFETs above 150 
Mb/s and to SiFETs above 20 Mb/s. If leakage were negligible, the 
GaAsFET would be superior to the SifET. However, since for cur- 
rently available devices the leakage runs from about 0.1 to 100 pA 
it must be taken into account in the determination of the relative 
merit of the GaAsFET. For example, with 0.2 vA of leakage the 
GaAsFET is superior to the SiFET above about 5 Mb/s. Unless the 
leakage is below about 1 pA it is never superior to the bipolar transistor. 
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Table Ill —- Device parameters for Fig. 7 


Device or) ( i) B 
Bipolar Transistors 4 160 
SiFET 8 0.005 
GaAsFET 4 0.016 


The previous cases apply to readily achievable results with com- 
mercial devices. However, these devices are not optimized for optical 
receiver applications. Figure 8 shows the optimum bias as a function 
of bit rate for each of the cases of Table I. The optimum bias current 
is extremely small and for many transistors the beta will start to drop 
before optimum is reached. By reducing the area of devices the range 


™™ 
~~ 


OPTIMUM BIAS CURRENT IN MA 


— CASE IL 





1 2 4 6 8 10 20 40 100 200 400 1000 
BIT RATE IN Mb/S 


Fig. 8—Optimum bias vs bit rate. 
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0 20 40 60 80 100 120 140 160 
FREQUENCY IN MHz 


Fig. 9—Reciprocal gain vs frequency with gain bandwidth product (gm/2aC;) as 
a parameter. 


of operation can be extended. Furthermore, it will reduce the capaci- 
tance. If the stray and detector capacitances can also be reduced an 
additional improvement in performance can be achieved. 

The upper frequency at which an FET can be used is set by the gain 
bandwidth product of the device in the circuit. Figure 9 shows the 
gain versus frequency with gn/2rC; as a parameter. With an inte- 
grating second stage the second-stage noise becomes insignificant if 
the reciprocal gain, 1/gnZ1, at the highest frequency of interest is 
below 0.5 while for the noise from a nonintegrating second stage to be 
insignificant the reciprocal gain must be below about 0.1 at the highest 
frequency of interest. 

For an FET, g, and C, are both proportional to gate length.* 
Increasing gate length increases drain noise (gm/C} < 1/length). On 
the other hand, decreasing gate length reduces the total capacitance 
more slowly than gm due to the detector and stray capacitance and 
thus increases the noise contribution of the succeeding stages. Thus 
an optimum value exists' which can be determined from eq. (15) or 
eq. (14) if the load noise is frequency independent. 


* Gate length refers to the dimension perpendicular to current flow and width to 
the direction of current flow. The former dimension is readily changed. The latter 
can press the technology. 

TG. L. Miller has recently pointed out that C, = C4 is the optimum condition 
when the first-stage drain noise predominates. 
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With improving technology it is expected that the gate width can 
be significantly reduced. Typical commercial silicon devices with 
5-um gate widths have gain bandwidth products of 132 MHz; experi- 
mental devices with 0.5-um gate widths have been reported with a 
gain bandwidth product near 1 GHz. 

We will now direct our attention to the question of dynamic range. 
Without avalanche gain, for an ac-coupled FET the voltage swing 
at the detector for the required minimum signal increases with the 
square root of the reciprocal bit rate. This effect has not been fully 
analyzed and data are not presently available indicating the bit rate 
at which amplitude distortion prevents proper equalization. However, 
at 6.3 Mb/s, problems were not encountered with the signal 10 dB 
above that required for 10~ error rate. 

For a bipolar transistor the signal current can become comparable 
with the bias current leading to gain variations with word pattern 
for an integrating front end. For example, with a —31-dBm signal 
(10-° error rate at 274 Mb/s) without avalanche gain the signal 
current is about 0.4 wA average. Since the optimum bias is near 5 pA 
this is appreciable. The ratio ViPn/ Too is proportional to 1/VC, and 
independent of frequency. Thus the ratio of the required minimum 
signal current to bias current is frequency independent, but can be 
expected to increase as C, decreases. 

If C. could be greatly reduced so that 

Vino > Loo 
then the bias could be turned off and integration would not take 
place. The noise would only be present when the signal was on and 
would be given by 
Bex = 2el signal- 


This expression is identical to the one which applies to ideal photo- 
multipliers and avalanche detectors with an infinite electron/hole 
ionization coefficient and large gain; the required signal is 3 dB above 
the quantum limit. 

Present bipolar transistors have a higher transconductance-to- 
device-current ratio than an FET. Thus, they consume less power. 
With avalanche gain where the required optical power is relatively 
insensitive to amplifier noise (varies at § root), supply power may often 
be the deciding factor. It has recently been suggested that FETs 
have the same limiting transconductance-to-current ratio as bipolar 
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transistors," e/kT. However, as yet this limit is not approached under 
practical considerations. 

Finally, bipolar transistors operated well above the reverse satura- 
tion current have better temperature stability than FETs. In appli- 
cations employing de coupling this could be an important consideration. 


APPENDIX 


The equivalent input noise current spectral density of a cascade 
of circuits is calculated as follows. First, the circuits are partitioned 
in any convenient manner. Next, the output impedance of each stage 
is found going from the input to the output of the cascade since the 
output impedance is a function of the input loading. Next, the contri- 
bution of the equivalent noise current of each stage is referred to the 
input by dividing it by the product of the short-circuit current gains 
of all of the preceding stages. Finally, the individual contributions are 
summed in the square sense, that is, 


n _ i-l 1 
— © t y) 
pee Pa Aa ea 


where A;(w) is the short-circuit current gain of the jth stage and Si, 
is the value of S., for the 7th stage. 

The current gain, output impedance, and contribution to S.q of 
S; and Se (Seqi and S.q2) for the common emitter (source), base 
(gate), and collector (drain) configurations are summarized in Table IV 
in terms of the common emitter (source) parameters. The primed and 
subscripted impedances are the parallel combination of the impedance 


Table IV — Equivalent parameters 








Configuration Equivalent Equivalent Cattanteain 
(Common Noise Due Noise Due A( : Output Impedance 
Terminal) to Si—Seaqr to S2—Seqz © 
Emitter (source) Si S2 z 1 Zz [ ZitZa | 
Z a aa ; =) : Zi+ZotZstomZiZ1 
ati =) Zi+Z. 
Zi+Z2 
Base (gate) Si Z3S2 ZitgnZiZs ZitZstomZ1Zs 
ZitonZZs | Zit+ZatomZZs | Zi +Za-+Zatom%Zs 
Collector (drain) Zi(gmZ,—1)S1 (Z14+Z2)S2 Z2(1+9mZ1) ZA+Z, 
ZigmZit1) | Z(1+omZi) D-+Zi * Zi: 4+Z,4+Zs+0mZiZs 
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with the same subscript and the output impedance of the preceding 
stage. 
When 
Ze > Zi) Z3, Zi a Z3 
and 
Z2 > 1/9 


the common emitter (source) and base (gate) configurations give 
appreciable current gain. Under this condition the approximate 
relations of Table V apply for Seqi, Seq2, A, and Z, emitter (source) 
and base (gate) stages; and for S.qi, Seq2, and A for a common collector 
(drain) stage. Table V also gives approximate parameters for output 
impedance for a common collector (drain) stage for the cases 


JmL1 > 1, Ds > 21 
and 
Zs XK Gm4Z123, ZeK Zi. 


The first case is typical of bipolar and the second of field-effect transis- 
tors. The || symbol represents the parallel combination of impedances. 

Insight into the choice of circuit configuration can be gained by 
examining the results given in Table V when the preceding assump- 
tions apply. If the gain of a single stage is large enough that the noise 
of subsequent stages is negligible, then all three configurations give 
identical results for Seq and Seq2. For this case, the choice of configura- 
tion would be made on the basis of output impedance. Since the 
detector impedance, except at extremely high frequencies, is large, a 
common base (gate) stage can usually be ruled out and the selection 


Table V — Approximate equivalent parameters 





Configuration 
(Common terminal) Sea Seq2 A (w) Zo 
Emitter (Source) Si a, —9gm(Zs||Z1) Zs 
ae PAP ral 2 SOR. 7! 
Base (Gute) Si GaZelZ) 1 GaLiaa 
@alleseor (Dian) Be | peep zat ena 
9m (Zs||Z1) mL 





Pa (Case II) 








1792 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 


between the common emitter (source) and common collector (drain) 
stage made on the basis of desired output impedance. Parenthetically, 
it is interesting to note that the output voltage for the two cases is 
identical—the first gives voltage gain but the second presents a higher 
impedance to the source. 

For multistage amplifiers a cascode configuration [common emitter 
(source) stage followed by a common base (gate) stage] is often 
chosen for the first two stages. A cascode amplifier has a wider band- 
width than a common emitter (source) stage and thus can simplify 
equalization. However, it does not lead to the lowest total noise since 
A 1 for a common base (gate) stage and therefore the full value of 
Seq for the following stage contributes to the equivalent noise spectral 
density. 

The approach which gives the lowest equivalent noise is a cascade 
of common emitter (source) stages—possibly followed by a common 
collector (drain) stage to match impedance—because common emitter 
(source) stages exhibit the current gain of common collector (drain) 
stages, but present a higher output impedance to the following stage, 
thereby minimizing the second-stage S.q2 contribution to Seq. 
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Reduction of Multimode Pulse Dispersion by 
Intentional Mode Coupling 


By D. MARCUSE 
(Manuscript received May 3, 1974) 


Guidelines for the design of multimode, step-index fibers with intentional 
fluctuations of the refractive index of the core are given, with the aim of 
reducing multimode pulse dispersion. It appears possible to engineer a 
fiber with carefully designed refractive index fluctuations, the azimuthal 
variation of which is governed by the function cos and the z dependence 
of which has a spatial Fourier spectrum with a sharp cutoff frequency. 
By limiting the location of the index fluctuations to a region below a certain 
radius Tmax, coupling to modes with large azimuthal mode numbers can 
be avoided and power loss via coupling to radiation modes can be held to a 


1. INTRODUCTION 


Optical fibers supporting many guided modes suffer from multimode 
dispersion. A pulse launched into a multimode fiber excites many 
modes, each traveling at a different group velocity. At the far end of the 
fiber the pulse is spread out in time because of the different group 
delays of each mode. This multimode dispersion effect 1s usually more 
serious than the single-mode dispersion caused by the dispersive effect 
of the dielectric material of the waveguide core and by the inherently 
dispersive nature of mode guidance. Discussions of multimode dis- 
persion in the absence of mode coupling can be found in Refs. 1, 2, 
and 3. 

S. D. Personick discovered that multimode dispersion in fibers can 
be reduced by intentional (or unintentional) mode coupling. If the 
power carried in the fiber transfers back and forth between slow and 
fast modes, averaging takes place, so that the pulse no longer breaks 
up into a sequence of pulses but is forced to travel at an average group 
delay with a concomitant reduction in pulse spreading. Although the 
spread of a pulse carried by uncoupled modes is proportional to the 
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length of the fiber, it only becomes proportional to the square root of 
its length if the pulses are coupled among each other.3-5 

However, reduction of multimode pulse dispersion by means of 
mode coupling must be bought at a price. Any mechanism that causes 
coupling among the guided modes also tends to couple guided modes to 
the continuum of radiation modes. Power coupled into radiation modes 
radiates away causing losses. Radiation loss can be reduced by careful 
control of the coupling process.® It is possible, at least in principle, to 
provide strong coupling among the guided modes but only very little 
coupling to radiation modes. The loss penalty can thus be controlled 
and kept to small amounts.?° 

Coupling between two fiber modes is caused by a specific spatial 
frequency of the Fourier spectrum of the coupling function. Two modes 
couple via a spatial frequency that is equal to the difference of the 
propagation constants of the two modes. Control of the loss penalty 
for multimode dispersion is thus possible by shaping the Fourier spec- 
trum of the coupling function. In general, it is desirable to achieve a 
spectrum that provides a sufficient number of spatial frequencies below 
a critical frequency and a sharp cutoff of the spectrum at the critical 
spatial frequency.*:5 

In this paper we discuss means of mode coupling by employing in- 
tentional fluctuations of the refractive index of a fiber whose unper- 
turbed core has a constant index of refraction (step-index fibers). It is 
necessary to shape the core-index fluctuations so that only modes with 
adjacent azimuthal mode numbers »v couple to each other. Additional 
control of the coupling process must be provided by a sharp cutoff of 
the coupling spectrum that can be achieved by careful design of the z 
dependence (z is the axial direction) of the index fluctuations. Finally, 
radiation losses can be minimized by limiting the index fluctuations 
to a region near the fiber axis. 

The paper begins with a discussion of the requirements on the Fourier 
spectrum imposed by the desire to minimize the loss penalty. Next we 
provide explicit expressions for the power-coupling coefficients and 
estimate the amount of index fluctuation that is necessary to achieve 
a desired reduction in multimode dispersion. 

This discussion is intended as a guide to the fiber designer, pointing 
out the possibilitics available for reduction of multimode dispersion 
and explaining the difficulties that must be overcome. 


Il. SHAPING THE SPATIAL FOURIER SPECTRUM 


We consider two modes with propagation constants 8; and 8;. Inter- 
action between these modes is described by a coupling coefficient that 
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depends on the distortion of the core boundary or on refractive-index 
irregularities. The azimuthal symmetry of the irregularity provides 
selection rules for the coupling process. The z dependence of the 
irregularity enters the coupling process via its spatial Fourier spec- 
trum. A sinusoidal component of the Fourier spectrum of the form 


F(@) cos 6z (1) 
couples the two modes only if the relation®: 
[8: — 8;| = 8 (2) 


is satisfied. (This requirement stems from first-order perturbation 
theory and is valid provided that the coupling is weak.) 

It is not hard to envision a sufficient number of spatial frequencies 
that couple all modes among each other. However, as pointed out in 
Section I, coupling to radiation modes causes power loss by radiation 
from the fiber core.*:* Jt is thus essential to avoid coupling between 
guided and radiation modes. To see whether this is possible, we must 
study the spacing (in 8 space) between the guided modes. I have com- 
puted the propagation constants of all the guided modes for a step- 
index fiber with 


V = (ni — n§)*ka = 40, (3) 


where ni = refractive index of fiber core, nz = index of cladding, 
k = free-space propagation constant, a = fiber core radius. The prop- 
agation constants were obtained as solutions of the simplified eigen- 
value equation of the optical fiber.13 When the propagation constants 
are listed in order of their numerical values, regardless of mode number, 
they appear approximately evenly spaced. This behavior of the propa- 
gation constants of fiber modes contrasts with the behavior of the modes 
of a dielectric slab. Here we find that the spacings between modes in- 
crease monotonically so that the spatial frequencies, required to couple 
nearest neighbors, also increase.? By providing a cutoff to the Fourier 
spectrum of spatial frequencies contained in the coupling function, we 
can provide coupling among lower-order modes of the slab and uncouple 
either the last guided mode or a few of the higher-order guided modes, 
depending on the value of the spatial cutoff frequency (see Fig. 1). 
Once the highest-order (or a few high-order) guided modes are un- 
coupled from the rest, there is no danger of incurring radiation loss 
caused by the coupling process. Things are not quite that simple in 
the round optical fiber because the modes are not naturally arranged 
with ever-increasing spacings between neighbors. 
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9 max 8 
Fig. 1—Schematic drawing of the desired spatial Fourier spectrum of the z de- 


pendence of the index fluctuations. The shape of the function is unimportant except 
for its abrupt cutoff. 


A good approximation to the actual solution of the eigenvalue equa- 
tion is obtained if we approximate the Bessel function by the formula’ 


ie £2 cos G — y)t — y arccos (2) — al a 


GFF 


In weakly guiding fibers the transverse electric field component can be 
represented as }3 


E, = AJ,(xr) cos (vd)e*8?, (5) 


with 

Kk = (nik? — B*)}, (6) 
To a good approximation, we may assume that H, = 0 at the core 
radius r = a. This approximation is better for modes far from their 
cutoff value but it gives a reasonable indication of the propagation 


constants for practically all modes. An approximate eigenvalue equa- 
tion of the guided modes thus follows from (4), 


[ (xa)? — v2]! — » arccos (=) - i = (2m — 1) 5 (7) 


form = 1, 2,3, ---. By regrouping this equation we obtain a form that 
is useful for iterative solutions, 


2)4 
Ka = {e+ [ (nm — Be + varccos (* J . (8) 
Using (6) and (7) we derive the following approximate expression for 
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the spacing between guided modes, 


Ap 1 
el ae ram — =: (m — 4)r 


eS BL Tea ” 


AB is the spacing between modes that are separated by an amount Ap 
of the azimuthal mode number » and by a change Am of the radial 
mode number m. 

If we place no restriction on the allowed values of Av or Am, it is 
impossible to shape the spatial Fourier spectrum of the coupling func- 
tion so that coupling to radiation modes is avoided. It is thus necessary 
to introduce definite ‘“‘selection rules’ for the coupling among the 
guided modes. We shall see later that it is possible to shape the refrac- 
tive-index distribution or the deformation of the core-cladding bound-' 
ary such that only modes with 


Av = +1 (10) 


can couple to each other. We shall thus assume that the selection rule 
(10) is enforced and continue our discussion on this assumption. The 
allowed values for Am remain arbitrary. However, it is true that the 
spatial frequencies for coupling between modes (that is, the value of 
A) are larger for larger values of Am. Since it is our aim to introduce 
a cutoff frequency into the spatial Fourier spectrum so that modes 
with large spatial frequency separation will be uncoupled, we restrict 
the discussion also to a limited range of values for Am and consider 
only the case 

Am =0 or #1. (11) 
Using (10) and taking Am = 0 we obtain from (9) 
Am = 0 


K? (m — 4)r 





M61 = Sires) or “a, OD 
For Am = +1 we obtain from (10) and (9) 
ne 2B we. Am = —Ap 








The case Am = +Avp has been excluded since it leads to larger spatial 
frequencies than those obtained from (13). 

Figures 2 and 3 illustrate the boundaries of various regions in mode- 
number space v, m. Both figures were drawn for V = 40 [see eq. (8) ], 
mi = 1.515, and ne = 1.5 so that ni/n2 = 1.01. The solid line delineates 
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Fig. 2—Various regions in mode-number space », m. All curves belong to V = 40, 
m2 = 1.5, and mi/n2 = 1.01 with the exception of the solid line labeled (V = 30). 
The solid lines indicate the boundaries of the range of guided modes for the respective 
V values. The broken line labeled Am = 0 delineates the area below which coupling 
of modes with the selection rule Avy = +1 and Am = 0 is possible. The broken line 
labeled Am = — Ay limits the range of coupling with the selection rule Avy = +1, 
Am = — Ay. The dash-dotted line is the limit of the coupling range caused by the 
location of r = rmax = 0.8a. The spatial Fourier spectrum cuts off at Omaxa = 0.15. 


the boundary of guided modes; it is obtained by plotting those values 
of v and m that result in xa = V [see eq. (14) ]. All guided modes are 
located to the left and below the solid line. The broken lines* represent 
lines of constant spatial frequency. They result from plotting the com- 
binations of v and m values that result in AB = 6max. The line labeled 
Am = 0 was computed from (12) and the line Am = —Ayv was ob- 
tained from (18). The broken lines delineate the boundaries for mode 
coupling with the spatial Fourier spectrum of the coupling function of 
Fig. 1, the cutoff frequency of which is @ = @max. Modes below and to 
the left of the broken lines couple to their nearest neighbors via the 
selection rule Avy = +1 and Am = 0 or Am = —Av. Modes located 
to the right and above the broken lines cannot couple to each other 


“The meaning of the dash-dotted lines and the solid line labeled (V = 30) will be 
explained later. 
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Fig. 3—Same as Fig. 2 except that Omaxa = 0.165, rmax = 0.5a. 


since no spatial frequencies achieving this coupling are available. 
Figures 2 and 3 differ only in the choice of the cutoff frequency @max. 
There are no transitions with Avy = +1 that couple via smaller spatial 
frequencies than the ones indicated in the figures. It is thus apparent 
that it is possible to provide coupling among most guided modes by 
means of the Fourier spectrum shown in Fig. 1. However, modes to 
the right and above the uppermost broken line remain uncoupled. It 
is necessary to uncouple a few higher-order modes in order to avoid 
coupling into the continuum of radiation modes. The conditions shown 
in Fig. 2 achieve this goal almost completely. Only the mode m = 1, 
vy = 34, lying on the boundary of the guided-mode region, is coupled 
to radiation modes as well as the other guided modes. Power is thus 
able to flow out of the guided-mode region causing radiation losses via 
this one guided mode. This power loss could be avoided by decreasing 
Omax- 

The conditions prevailing in Fig. 3 would result in a high loss penalty 
since all modes with m < 4, v > 21 on the boundary of the guided- 
mode region couple to guided as well as radiation modes. However, we 
shall show later that it is possible to prevent mode coupling for modes 
exceeding a certain maximum » value that can be chosen by a suitable 
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design of the intentional index fluctuations. It is thus possible to 
achieve a low loss penalty even for the conditions shown in Fig. 3 
provided the coupling mechanism is carefully designed to avoid cou- 
pling for modes with v > 20. 

Modes located between the two broken lines in Figs. 2 and 3 can 
couple only to their neighbors above and below in the mode-number 
plane. However, since the members with low »v values below the line 
Am = —Ayr are able to couple to their neighbors to the left and to the 
right, all modes below the uppermost broken line are actually coupled 
together. 

We can give an approximate rule for calculating the spatial cutoff 
frequency appearing in Figs. 2 and 3. We begin by specifying the 
maximum v value on the mode boundary for which mode coupling 
should cease. As mentioned earlier, the design specification for this 
value ymax Will be given in the section on mode coupling. Next we need 
to know the corresponding value of m on or near the mode boundary— 
the solid line in Figs. 2 and 3. We obtain it from the cutoff condition 
xa = V and (8), 





Mae = g— "BA arcoos ( THE) + (V8 — vba (14) 
Substitution of ymax and mMmax into (12) using 6 = nek yields the 
desired value for AB = O0max. For V = 40 and ymax = 20 we obtain 
from (14) mmax = 4.61 and from (12) @maxa = 0.17 in agreement with 
Fig. 3. Of course, it does not make physical sense to use a noninteger 
Mmax, but it is advisable to use this value in (12) in order to obtain a 
more accurate value of @max.- Incidentally, (14) defines the mode 
boundary if we use it for all possible values vy = ymax. 


Ii, POWER COUPLING COEFFICIENTS 


Mode coupling in multimode dielectric optical waveguides is most 
conveniently described by a coupled-mode theory. The power coupling 
coefficients are defined as follows :*+5 


honsum = (Soni unl? )s (15) 


The symbol ( ) indicates an ensemble average. The coefficient Kyn,um 
stems from the coupled amplitude equations and is defined as® 


Ka. ee |" ag | rar(n? et ee (16) 
yn, um 4iP ; 0. vn um 


The angular frequency of the radiation is w, €) is the dielectric permit- 
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tivity of vacuum, and P designates the power normalization constant 
of the modes, the electric field vectors of which are indicated by script 
letters. The refractive-index distribution of the actual guide with index 
fluctuations is n while mp» indicates the index distribution of a perfect 
guide from which the actual guide deviates only slightly. Our interest 
is focused on introducing intentional index fluctuations for the purpose 
of mode coupling. We are thus free to choose n to achieve our goal. 
It was pointed out earlier that coupling to radiation modes is unavoid- 
able unless certain selection rules are imposed on the coupling process. 
Our discussion in the last section was based on the selection rule 
Av = +1. To achieve this selection rule we must require that the 
refractive-index distribution be of the following general form: 


n? — ng = 2nAng(r) f(z) cos ¢. (17) 


We know from earlier work that the ¢-dependence of this index dis- 
tribution leads to the desired selection rule.? 

We found in the preceding section that it is also desirable to avoid 
coupling among modes with large » values. It follows from the proper- 
ties of Bessel functions that the field intensity of the transverse field 
components is very weak for radii that obey the relation 


Kr <p. 


This result is easily interpreted in terms of ray optics. Modes with 
large values of v are represented by skew rays that spiral around the 
fiber axis. These rays avoid the vicinity of the waveguide axis and stay 
nearer to the core boundary for larger values of v. The radius defined 
by xr = vy represents the turning point below which a ray with a given 
value of » does not penetrate. The coupling formula (16) shows that 
mode coupling depends on the field strength at the point where the 
index irregularity is located. By providing refractive-index variations 
that do not extend beyond a radius 7max, it is possible to limit mode 
coupling to modes whose » values remain below a maximum value 
near the mode boundary that is defined as 


Tmax 
Vmax = V re (18) 
xa has been replaced by its maximum value V. The values of ymax and 
the corresponding values for mmax and Omax defined by (14) and (12) 
determine the position at which the dotted curves labeled Am = 0 
cross the solid curves in Figs. 2 and 3 that define the guided-mode 
boundary in mode-number space. Coupling of guided modes to radia- 
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tion modes can be avoided by limiting the v values of those modes that 
are coupled by index fluctuations. If the intentional index fluctuations 
do not extend beyond the radius 7max, coupling is restricted to those 
modes that remain below a boundary defined by the equation 


PS eae: (19) 


If we combine this equation with formula (8) we obtain the function 
m = m(v) that defines the boundary in mode-number space beyond 
which mode coupling ceases, because the index fluctuations are re- 
stricted to radii r S rimax, 


1 v a 3 Tmax 
mazt2 (3 -1) — arcoos Znes |. (20) 


Tmax 





This boundary is shown as a dash-dotted line in Figs. 2 and 3. We now 
have the means of providing coupling among all guided modes that 
remain inside the nearly triangular areas that are bounded by the 
dotted lines labeled Am = 0 and the dash-dotted lines in Figs. 2 and 3. 
If these areas remain below the guided-mode boundary (the solid line 
labeled V = 40 in Fig. 2), coupling to radiation modes can be avoided. 

After this digression into the fundamental properties of mode cou- 
pling we proceed with the derivation of specific coupling formulas. 
Substitution of the field vectors, given by (5) and in more detail by 
Ref. 8, into (16) we obtain with the help of (17) 


__ Bretntt® f rg(r)J (kent) J u(Kymr)dr 
The parameter 7, is defined as 
Yon = (Bin — nzk?)? (22) 


and 6,, is Kronecker’s delta symbol. 
Of the many possible choices for the function g(r) we use only two 
examples that may be of particular practical interest. First we use 


g(r) = Wé(r — rmax)- (23) 


W is the very narrow width of the ring of index fluctuations. We sub- 
stitute this equation into (21) but proceed immediately to the power 
coupling coefficient (15). The Fourier spectrum is defined as 


F(6) = lim = f ” F(z)e-idz (24) 
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and the power coupling coefficient for a narrow, ring-shaped refractive- 
index fluctuation is given as 
BAW RR Ry aad 6 Kea hae) Ka aee) 2 


= 2 
Visitas V2I ya (Kyn@)S p—1(Kym@) (\F| Ort, m- (25) 





Because the relation 
J,(Ky»nt) 0 (26) 


is approximately valid we have used 
J y4.1(Kyn) = —J,-1(KynQ). (27) 
As a second example we consider the function 


1 forr < tmax 


QO forr > fmax- (28) 


= 
Limiting the index fluctuations to a wide range, 0 < 7 < rmax. In this 
case the power coupling coefficient assumes the form 


DONT aroun Vien | Mand » Wont as) d iat Kat ae) 
Vie sien res ca gid A Koa inas) da Kat wae): | 
Vl —= Kean Pei Runt pea (Kisinh) 
CLF? )6,41,40 (29) 
The argument of the power spectrum (|F?|) in (25) and (29) is 
Bon =e Bienes 





IV. PULSE WIDTH REDUCTION BY INTENTIONAL MODE COUPLING 


Random coupling of the modes of a multimode fiber causes the many 
pulses, traveling on different guided modes at different group velocities, 
to be coupled together so that an equilibrium pulse establishes itself 
traveling at an average group velocity. Its ensemble average has a 
gaussian shape’ the width of which is given by the formula” 


T = 4(p2L)}, (30) 
with" 
« [2 2%(4-2) 29 | 
y= Vv, Va 
p= BP oy 


Equation (30) shows that the width of the pulse grows proportionally 
to the square root of the length L of the fiber. The parameter p2 is the 
second perturbation of the first eigenvalue that results from an alge- 
braic eigenvalue problem, Bf are the components of the 7th eigen- 


i 
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vector, p§” is the ith eigenvalue of zero order, and v, and », are the 
group velocity of mode v and the average group velocity. 

The evaluation of this expression is difficult. If only a few guided 
modes exist, computer solutions of the eigenvalues and eigenvectors 
can be obtained. If many modes are guided, it is possible to convert 
the algebraic eigenvalue problem to a partial differential equation, 
provided that it can be assumed that only nearest neighbors couple 
among each other.?!? For modes close to the edge of the coupling 
range, indicated by the dotted lines in Figs. 2 and 3, the assumption of 
nearest-neighbor coupling is well justified since the spatial Fourier 
spectrum of the coupling function lacks the higher frequencies re- 
quired to couple a mode to a neighbor farther away. However, the 
lower-order modes are crowded more closely so that there are spatial 
frequencies available for coupling to modes other than the ncarest 
neighbors. This increase in coupling strength is partially compensated 
for by the fact that modes above the dotted line labeled Am = — Av in 
Figs. 2 and 3 can couple only vertically. In order to be able to give an 
order-of-magnitude estimate of the index fluctuations required to 
achieve a certain reduction of the pulse width, we shall assume that 
nearest-neighbor coupling can be assumed and provide an expression 
for the pulse width reduction that may be regarded as a crude approxi- 
mation. 

We use the theory presented in Ref. 3. The only modification neces- 
sary to the formulas presented in Section 5.6 of Ref. 3 consists in the 
realization that our coupling scheme avoids radiation losses, so that 
the highest-order members of the group of coupled modes are not 
depleted contrary to the assumption in Ref. 3. The eigenvectors and 
eigenvalues defined on p. 235 of Ref. 3 can be used, except that the 
first eigenvector is now constant, independent of the mode number. 
All eigenvectors are mutually orthogonal and properly normalized. 
Using the procedure explained in Ref. 3 we obtain the result 


_ T _ 0.225V. 
~ Ar (Zh)? 


T is the full width at the 1/e points of the gaussian-shaped pulse, Ar 
is the width of the pulse train that, would exist in the absence of cou- 
pling, V is the normalized frequency parameter defined by (3), L is 
the length of the fiber, and h the power coupling coefficient. It was 
assumed that only nearest neighbors couple to each other and the 
strength of these coupling coefficients was assumed to be identical for 
all the modes. 





(32) 
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R is the improvement factor that indicates the reduction of the 
length of the pulse relative to its uncoupled length. It is thus desirable 
to achieve as small a value of RF as possible. Values of R > 1 do not 
describe a physically meaningful situation. The formula may result 
in values R > 1. This indicates that the coupling is not strong enough 
to achieve an equilibrium pulse in the length of fiber available. 

We are discussing numerical values for the two types of couplings 
described by (25) and (29). Values have been computed for 


> = Von@ s41.mOd o (Kent max)d 41 Kyi il nae) ; 
Nisan a — ~ Wileiivtesiwe (33) 

and 

: Vea Vari nOl Kovlawd 1 Kia nax)d 0 Kyi inl ax) 

Ayn: v+1,m >= — Kid A (Kent aaa) ed 94 RA ae mas) | (34) 

a(Ken =s Kat md p—1(Kyn@)ed »(ey41,mB) 
for all the modes of a fiber with V = 40, n1 = 1.515, and m1/ne = 1.01. 
A few sample values are listed in Tables I and II. The values listed 





Table |— Sample values of normalized coupling coefficients 
for a narrow ring of refractive-index fluctuations 
located at r = max With rnax/a = 0.8 


v n Visacvtie henjreijn-1 
1 1 2.212 108 0. 
1 2 4.608 106 3.987 105 
1 3 1.023 108 2.879 106 
1 4 1.287 103 7.733 104 
1 5 4,780 10° 5.678 108 
1 6 3.844 106 1.931 106 
1 7 2.726 10° 4.107 106 
1 8 7.018 104 8.497 105 
1 9 2.262 104 5.610 103 
1 10 2.431 106 5.888 105 
1 11 4.236 106 4.181 106 
1 12 3.964 105 2.115 108 
10 1 4.945 107 0. 
10 2 7.071 108 1.981 105 
10 3 1.597 107 8.981 105 
10 4 1.517 107 1.841 107 
10 5 1.321 105 2.600 106 
10 6 1.467 10° 1.451 105 
10 7 3.179 107 9.235 106 
10 8 4.896 107 4.112 10? 
20 1 5.807 107 0 
20 2 3.229 108 1.227 108 
20 3 4,017 107 1.860 108 
20 4 4.954 107 2.556 107 
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Table !1— Sample values of normalized coupling coefficients 


for a wide band of refractive-index fluctuations extending 
from r= 0 to Fr = rmax With Pmaxs/a = 0.8 








v n Ayn; p+i,n Ayn; y+1,n—1 
1 1 6.078 104 0. 
1 2 1.141 105 1.349 10! 
1 3 8.734 104 6.439 104 
1 4 4.845 10! 8.157 104 
1 5 4.439 104 5.351 104 
1 6 6.481 104 3.728 104 
1 7 7.628 104 4.844 10! 
1 8 6.336 104 6.806 104 
1 9 5.318 104 6.709 104 
1 10 6.474 104 5.431 104 
1 11 8.767 104 5.794 104 
1 12 1.136 105 8.628 104 
10 1 1.817 105 0. 
10 2 5.642 104 8.361 105 
10 3 1.215 105 2.730 105 
10 4 1.652 105 3.173 105 
10 5 1.377 105 4.373 105 
10 6 1.627 105 4.063 105 
10 7 3.429 105 4.248 105 
10 8 1.299 106 9.413 105 
20 1 7.904 104 0. 
20 2 5.450 105 1.264 106 
20 3 6.717 105 4,260 106 
20 4 2.118 108 7.286 106 


in Table I fluctuate because the narrow band of refractive-index varia- 
tions may occasionally find itself located near a node of one of the two 
field functions so that the coupling coefficient can even vanish for a 
certain pair of modes. For this reason it is advisable to provide at least 
two bands of the kind (23) at different radii. The coupling coefficients 
for the second case, listed in Table II, corresponding to a wide band of 
refractive-index fluctuations, show far less variations. or want of a 
better procedure we use average values of the coupling coefficients in 
(32). It may be expected that the low values of the coupling coefficients 
determine the rate at which power is exchanged among the modes. On 
the other hand, we know that more than just nearest neighbors couple 
for low values of » and n. In order to achieve an order-of-magnitude 
estimate we use the arithmetic mean of the entries in the first block of 
data in the tables for y = 1 and obtain for the average value of the 
coupling coefficient h from Table I and (25) 


Ankrnax \2 W? 
h = 2x 108 ( Sttoen ) a (IF). (35) 
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Likewise, we find from Table II and (29) 


. Ankrmax \* 3 
b= 7x 10 ( Steer)" (1/2), (36) 
It is clear that (35) and (36) hold only for the special case V = 40, 
my = 1.515, and ni/nz = 1.01. These values correspond to a fiber with 
radius a = 30um if the free-space wavelength is assumed to be 
Ao =] um. 

We are now asking for a pulse width reduction of R = 0.1 in a 
fiber whose length is Z = 1 km. From (382) and (36) we obtain with 
Tmax = 0.8a 


(An)?(|F'|?) = 4 & 1077a. (37) 
The value obtained from (32) and (35) is identical with (387) if 
W/a = 0.19. (38) 


It is apparent from (37) that it is the product of (An)? with the ampli- 
tude of the spatial power spectrum [of the z dependence of the index 
fluctuations f(z)_] that determines the effectiveness of the index fluc- 
tuations for reducing pulse dispersion. We relate these quantities to 
the rms variation of the refractive index as follows. For slight index 
differences we have 

n? — ng = 2n1(n — No). (39) 


Thus we obtain from (17), using g(r) = 1 according to (28), 
((m — 1o)?) = 3(An)?(f?(z)). (40) 


The average ( ) includes in this case also averaging over cos? ¢. The 
variance (f?) is related to the power spectrum by the equation 


(pr) =< f° ([F@2|)a0 = "2% (| F|). (41) 
T 0 Tv 

On the right-hand side of this equation we introduced the average 

value of the amplitudes of the spatial power spectrum (|F?|) and the 

spatial cutoff frequency @max- If we interpret (|F?|) in (87) as the 

average amplitude appearing in (41) we obtain from (87) through 

(41) 


[((n — 1o)?) ]* = 2.52 X 10-*(Omaxa)?. (42) 
With Omaxa = 0.15 of Fig. 2 we obtain finally 
L((n — 10)?) }# = 10~. (43) 
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For our specific example, an rms deviation of this magnitude is re- 
quired to achieve a relative pulse width improvement of R = 0.1 (a 
ten-times-shorter pulse of coupled modes compared to operating with- 
out mode coupling). The spatial Fourier spectrum is assumed to be 
flat from zero spatial frequencies to a cutoff spatial frequency of 


Omax = S = 50 cm”. (44) 


The shortest period appearing in the Fourier spectrum is thus 


A= = 0.13 em. (45) 





The index fluctuation of the narrow ring defined by (23) is of the same 
order of magnitude as (43) if the relation (88) holds. However, one 
narrow ring causes gaps in the coupling process for those modes whose 
nulls coincide with the position of the ring. It is thus advisable to use 
at least two rings. 


V. SUGGESTIONS FOR THE DESIGN OF INDEX FLUCTUATIONS 


The spatial Fourier spectrum with a sharp cutoff frequency shown in 
Fig. 1 can be generated by passing noise through a low-pass filter. 
Another method of producing the desired index fluctuations consists 
in superimposing a number of sinusoidal variations. From (17), (28), 
and (39) we have 

nN — No = Anf(z) cos ¢. (46) 


If f(z) is a superposition of sine waves with random phase we have 
M 
Nn — N = An > sin (Q,2 + v.) | COs ¢. (47) 
v=1 


The y, are random phase functions of the individual sine-wave com- 
ponents. As a practical matter, it is probably easiest to generate these 
random phases by letting the phase stay constant over a distance D 
at which point it makes a random jump to another constant value. 
We assume for the purpose of our analysis that the phases are uncorre- 
lated among each other. 

It can be shown that the power spectrum of the function 
sin (0,2 + y,) may be approximated by (see appendix) 


sin? (@ — Q,) . 
(| F,(8) |?) = -8,D (48) 
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The total power spectrum of the function f(z) is thus 


ye Sin? (@ — Q,) 5 
(| F(@) |?) = 20-8) (49) 


D is the correlation length of the phase functions. In the case that the 
phases stay constant and jump randomly to a new value after a dis- 
tance D, this distance is identical to the correlation length. The full 
width of the spectrum (48) is given by 


4r 
A@ = D’ (50) 
A®é is the distance between the first two zeros of the (sin x)/x function 
on either side of its main maximum. Since we need to fill a spectral 
region of width @max we need a total number of 


OmaxD 


dn (51) 


M = 
sinusoidal components. If we use D/a = 1,000 we need with @mnaxa 
= 0.15, M = 12 sinusoidal components in (47). 

The necessary amplitudes An of the sinusoidal index variations are 
obtained from (87). Since the spectral components in (49) overlap 
only slightly, we may use 


(IF) = 2 (52) 


at the peak of each sinusoidal contribution. Using the numerical value 
of (87) we thus have 


An = 1.3 X 10-3 (5): (53) 


With D/a = 1,000 we would thus have An = 4 X 107°. 

It is clear from our treatment that the numbers given here are only 
valid to an order of magnitude because of the many approximations 
that were made for their derivation. 

The implementation of this prescription for the desired index fluc- 
tuations to the design of a fiber is not a trivial matter. 

If the refractive-index increase of the core material is achieved by a 
doping process, it may be possible to program the doping procedure to 
result in the desired fluctuations. Some processes add the dopant in the 
gaseous phase to the material of the fiber preform. In this case it may 
be possible to control the flow of dopant at a predetermined rate that 
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is synthesized as the superposition of sine waves shown in (47). If the 
fiber preform is treated in this way, it would be necessary to compress 
the periods of the sine waves in such a way that the desired periods 
result after the drawing process. As mentioned earlier, the desired 
Fourier spectrum can also be derived from filtered electrical noise 
signals. 


Vi. DISCUSSION 


We have studied the possibility of reducing multimode pulse dis- 
persion by means of introducing refractive-index fluctuations into the 
core of the fiber. Several requirements must be imposed. The coupling 
process must be able to couple all modes of as large a mode group as 
possible among each other without coupling to the modes of the con- 
tinuous spectrum. To achieve this, it is necessary to limit coupling to 
the modes inside of the area of guided modes in the mode-number 
plane. Modes near the outer edge of this range should remain un- 
coupled to avoid radiation losses. This goal can be achieved only by 
designing the coupling mechanism so that a definite selection rule is 
imposed. For simplicity we considered index fluctuation with an 
azimuthal dependence of the form cos ¢. This azimuthal index dis- 
tribution ensures that a mode with azimuthal mode number » couples 
only to modes with » + 1 and »v — 1. Once this selection rule is im- 
posed, it is possible to limit coupling to modes inside of an area in 
mode-number space not including its outer edge. The boundary delin- 
eating the area of coupled modes in the mode-number plane tends to 
cross the boundary enclosing all guided modes at large values of ». 
Thus the danger exists that power outflow to radiation modes occurs 
via modes with large v values. This remaining problem can be avoided 
by limiting the intentional index fluctuations to a region inside the 
fiber core that remains below a certain radius r < rmax < @. 

The spectrum of spatial frequencies of the z-dependent part of the 
coupling process must extend from (essentially) zero to a cutoff value 
Omax- A prescription for finding @max Was given in the section on the 
spatial Fourier spectrum. In principle, this spectrum can be syn- 
thesized as a superposition of sinusoidal components with random 
phase. This problem is discussed in the section on design suggestions 
for index fluctuations. 

Our procedure leaves one question unanswered: What happens to 
those modes that remain intentionally uncoupled, don’t they broaden 
the impulse response of the fiber? These modes may be naturally more 
heavily attenuated than the coupled modes because of their proximity 
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to the cutoff region in mode-number space. However, if the naturally 
occurring losses are insufficient to suppress these modes, steps must be 
taken to filter them out. This filtering is possible either by constricting 
the fiber at the receiver or by using spatial filtering of the radiation 
escaping from the end of the fiber prior to entering the detector. A 
reduction of the fiber diameter reduces the V value of eq. (8). Figure 
2 shows a solid line labeled (V = 30). In the constricted fiber this line 
becomes the new mode boundary. It is clear that the constriction re- 
moves all the uncoupled modes in addition to some of the coupled ones. 
An additional amount of loss must be tolerated as payment for the 
pulse cleaning operation. The total number of guided modes supported 
by the fiber is given as!3 

y2 
a 
As the V value is reduced from 40 to 80 the mode number drops from 
800 to 450. We lose roughly half the power in the attempt to clean up 
the pulse distortion that results from the free-running, uncoupled 
pulses. However, this 3-dB loss penalty is independent of fiber length 
and may be well worth paying in return for a considerable reduction 
of pulse dispersion. The conditions of Fig. 3 are far less favorable, 
here about 75 percent of the power would be lost. 

Practical implementation of these ideas will tax the ingenuity of the 
fiber designer. Introducing desired index fluctuations with the re- 
quired sharp cutoff of their spatial Fourier spectrum is an exacting 
requirement. The cutoff frequency @max must be kept to within a few 
percent. This means that the spectrum must terminate with a sharp 
slope. If the spectrum is synthesized as a superposition of sine waves 
we must require that 


Aé Ar 1 
i Dba < 0.01. (55) 
The number 0.01 was chosen arbitrarily, but it is certainly of the correct 
order of magnitude, A@ was obtained from (50) and 7 from (51). This 
estimate shows that approximately 100 sinusoidal components are 
required to ensure that the spectrum has a sufficiently steep slope at 
its cutoff point. Of course, this requirement could be somewhat relaxed 
by allowing the sinusoidal components deeper inside the spectrum to 
be broader than those near its edge. These considerations may enter 
into the compromise that the designer wishes to achieve. 
The mode spacing (in 8 space) changes as the fiber is bent. The 
modes of a dielectric slab waveguide have smaller mode spacing in 


Ni = (54) 
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circularly bent sections of the guide. In principle, this effect could alter 
our conclusions regarding the possibility of uncoupling high-order 
modes near cutoff by limiting the width of the spatial Fourier spec- 
trum of the coupling function. However, for the numerical example 
used in our discussions, the change in mode spacing due to waveguide 
curvature is unimportant for bent slabs whose radii remain about the 
limit 
R, > 1,000a. 

For a = 30 um, R,. would have to be kept larger than 3 cm. It is not 
known if the numbers for the round fiber would be identical to the slab 
model, but one may expect that waveguide curvature is unimportant 
if the radius of curvature remains larger than a few centimeters. 


APPENDIX 
We provide a simple derivation of eq. (28). Let 
f(z) = sin [Oz + y(z)]. (56) 


The phase function y(z) is assumed to be constant over a distance D 
but jumps randomly to a new constant value at the end of each inter- 
val. The Fourier transform of f(z) is 


F(0) = 1 = (m+1)D é on 
( = =| sin [Qz + y, Je z 


ident sin | (2 -a2 
a tWnet(Q—6) (n+9)D ED 
ou i@— 0) vn 


A term with Q + @ has been neglected since its contribution for large 
values of D is negligible. It is also assumed that L is an integral mul- 
tiple of D, 

L = mD. (58) 


When we form the absolute-square value of F and take the ensemble 
average, all cross terms in the resulting double summation vanish 
because of our assumption of random values for Y,. The phase terms 
in the remaining terms of the summation reduce to unity by the process 
of taking the absolute-square value. With no summation index left 
under the summation sign, the sum results simply in a factor m. Using 


(58) we finally obtain 
sin? | @ — Q) = | 
(| F(6) |?) = (eas (59) 
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An outage of the Plano, Illinois, to Cascade, Iowa, link of the L4 
coaxial cable occurred at about 2240 UT on 4 August 1972 during a 
large geomagnetic storm. The available geomagnetic data measured in 
North America, as well as data received from two satellite instruments, 
are analyzed. These data show that, at the time of the L4 outage, the 
boundary of the magnetosphere was pushed to unusually low altitudes 
by a greatly enhanced solar wind. As a result, large, rapid changes of the 
earth’s magnetic field strength were observed over North America. It 1s 
demonstrated that the field changes at about 2241 to 2242 UT were of 
such magnitude as to induce earth currents of sufficient strength to produce 
the L4 outage by causing a high-current shutdown of the system link. 
The geomagnetic disturbances that produced the shutdown were not of 
the auroral-electrojet type normally associated with disruptions of power 
systems. 


I. INTRODUCTION 


The solar and geomagnetic disturbances resulting from solar active 
region 11976 were truly outstanding in many regards. The principal 
solar region of the activity was in the highest solar activity class on an 
absolute scale; this major solar activity occurred during the declining 
phase of the 11-year solar cycle.! The solar disturbances, propagating 
outward into interplanetary space, produced the largest galactic 
cosmic ray decrease on record.? The geomagnetic storms (with ac- 
companying ionospheric and auroral disturbances) resulting from the 
interaction of the propagating solar disturbances with the earth’s 
magnetosphere were the most severe recorded in well over a decade. 

The interaction of the greatly enhanced solar wind’ with the earth’s 
magnetic field on 4 August 1972 produced extreme compressions and 
distortions of the magnetosphere. It was during the period of the most 
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severe magnetospheric distortion that an outage of the L4 coaxial 
cable carrier system occurred over the link from Plano, Illinois, to 
Cascade, Iowa. This paper, utilizing most of the available North 
American geomagnetic data, describes the geophysical occurrences 
and conditions at the time of the L4 outage. These data, plus magnetic 
field data from two satellites, demonstrate that the L4 outage was 
associated with the extreme compressions and distortions of the 
magnetosphere and not simply with greatly enhanced auroral currents, 
as are often assumed in discussions and models of magnetic storm- 
induced power-system disruptions. The North American geomagnetic 
data are used as input to a model for the calculation of currents induced 
in the earth by the geomagnetic disturbances. It is shown that these 
earth currents were sufficient to produce a shutdown of the L4 system. 


Il. GEOMAGNETIC OCCURRENCES 


The subsequent discussions of the available geomagnetic data 
during the August 1972 storm period must be placed in the perspective 
of normal magnetospheric conditions. Figure 1 is a view of the earth’s 
magnetosphere in the equatorial plane. The distance from the earth’s 
surface to the magnetospheric boundary is approximately 9Rz (1Rez 
=~ 6.5 X 10% km) for normal solar wind density conditions of 2 5 
protons cc“. Figure 1 also shows the circular, earth-synchronous 
satellite orbit at an altitude of %5.5Rz, which is occupied by all 
common-carrier communications satellites. For later reference, the 
location of the NASA synchronous altitude Applications Technology 
Satellite 5 (ATS 5) at 2200 UT is indicated. Also indicated, for the 
same UT, is the apogee location of the near-equatorial, elliptical-orbit 
satellite Explorer 45.5 

The approximate geomagnetic latitude and longitude of Plano, 
Illinois (50.8°N, 336.8°E) are shown on the earth in Fig. 1 for 2200 
UT. Also indicated on the earth are the locations of several North 
American magnetic observatories: Meanook (61.81°N, 301.07°E), 
Churchill (68.69°N, 322.63°E), and Ottawa (56.80°N, 351.52°E). 
All geomagnetic stations used in this study are listed in Table I, 
together with their coordinates. 

Acquisition of sufficient geomagnetic data for a definitive analysis 
of the geomagnetic effects of the storm in North America was made 
difficult by the inability of most magnetic observatory instruments 
to record the very large and rapid variations that occurred during 
the hour 22 UT on 4 August. Because of the instruments’ limitations, 
it is likely that many of the actual variations were larger than those 
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Fig. 1—View of earth and magnetospheric configuration in equatorial plane under 
normal solar wind flow conditions. 


used in this paper.® Indeed, for the analysis of the 20-minute interval 
(2230 to 2250 UT, 4 August), around the time of the L4 outage, data 
at one-minute time intervals could be scaled from the continental 
U. 8S. and Alaska standard observatory chart records for only the 
observatories at College, Sitka, Tucson, and Fredericksburg. Scalings 
of 24 minutes were obtained from observatories at Castle Rock and 
Dallas. Scalings of 1 minute were obtained from a special National 
Oceanic and Atmospheric Administration station near Boulder. Fortu- 
nately, the Earth Physics Branch of the Department of Energy, Mines, 
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Table | 


Geomagnetic Coordinates 
Geomagnetic Station 


Lat(°N) Long (°E) 

Baker Lake 73.74 315.31 
Boulder 48.85 316.44 
Cambridge Bay 77.7 300.3 

Castle Rock 43.48 298.62 
College 64.66 256.51 
Dallas 42.96 327.75 
Fredericksburg 49.55 349.84 
Fort Churchill 68.69 322.63 
Meanook 61.81 301.07 
Ottawa 56.80 351.52 
Sitka 60.00 275.34 
St. Johns 58.50 21.24 
Tucson 40.03 311.41 
Victoria 54.08 293.04 


and Resources in Ottawa was operating a number of digital-recording 
magnetometers in Canada during 1972.6 These data have greatly 
facilitated our analysis of the magnetic disturbances. Most of the 
available U. S. magnetometer data were obtained from the World 
Data Center A and were scaled for us by the Geophysical Institute of 
the University of Alaska. 

Plotted in Fig. 2 are the magnetic variations measured in the north- 
south (H or X) and east-west (D or Y) orthogonal directions at the 
three Canadian observatories, Meanook, Churchill, and Ottawa, 
during the time interval 2200 to 2300 UT on 4 August. At the bottom 
of both sets of data is plotted the magnetic field trace of the Z-com- 
ponent recorded by a magnetometer on the ATS 5 satellite.’ The 
Z-component of the magnetic field at the ATS 5 location is measured 
parallel to the earth’s spin axis. 

The data plotted in Fig. 2 show particularly large field changes 
occurring during the several-minute time interval following 2240 UT. 
For example, at about 2242 UT, the field change AH/At measured 
at Meanook was 1800 y/min. (ly = 10-5 gauss). The field data 
from the ATS 5 satellite are particularly interesting in that, at 2 2225 
UT and 2240 UT, the field direction reversed. At 2241 UT, the 
field intensity was ~ — 4007; i.e., the measured field pointed in a 
direction opposite to that of the normal dipole field. At 2242 UT, 
the field direction at ATS 5 suddenly became normal; at this time, 
AZ/At at ATS 5 was & 575 y/min. 
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Fig. 2—Magnetic field variations observed at three Canadian observatories 
(Meanook, Churchill, and Ottawa) in north-south and east-west directions during 
the hour 2200 to 2300 UT on 4 August 1972. Plotted beneath each set of data are the 
variations in the field observed near the equator on the ATS-5 spacecraft. 


Reversals of the earth’s field direction, similar to those shown in 
Fig. 2, have been observed in the past by magnetometers on the 
ATS 18 and ATS 59 satellites. These occurrences have been attributed 
to a movement of the magnetopause to a location inside the orbit of the 
synchronous satellite, which then measures the magnetic fields in the 
earth’s magnetosheath region. The complex particle and field changes 
that occur in the magnetosphere during such a ‘“‘boundary-crossing” 
event have been discussed in a series of papers devoted to extensive 
study of one such event.®-1°—8 
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The distortions of the magnetospheric boundary as evidenced by 
the boundary-crossing event observed on ATS 5 on 4 August 1972 
were accompanied by the large geomagnetic field changes observed 
on the ground in North America. It is interesting to note that the 
study of several boundary crossing events on ATS 5° suggested that 
the largest effects were observed on the ground when the magneto- 
sphere distortion occurred in the local afternoon side of the magneto- 
sphere rather than in the local morning side of the magnetosphere. 
For the 4 August event, no local morning data are available from a 
synchronous satellite. However, the magnetosphere distortions that 
were observed were in the local afternoon sector. 

The one-minute values of the magnetic field changes [H(X) and 
D(Y) components] that could be determined from North American 
observatory data were used to make contour maps of the field changes 
at one-minute intervals in the U. S. and Canada. These contours 
for 2238 to 2243 UT are plotted in Figs. 3a and 3b in a geomagnetic 
coordinate system. Because of the spread in distance between observa- 
tories and the impossibility of obtaining a reliable magnetic field 
reading from the chart records of several observatories, the locations 
of some contour lines in Figs. 3a and 8b are, at best, extrapolations. 
Yor lack of a better justified procedure, the contours have been con- 
structed on the basis of linear interpolation between observatory 
field values. It should be noted that, for more conventional magnetic 
disturbances arising from auroral electrojet current systems, the fall-off 
in magnetic disturbances from higher to lower latitudes is nonlinear ; 
i.e., the disturbance level falls off more rapidly at the lower latitudes. 
The changes in the location and intensity of the magnetic disturbances 
from the relatively undisturbed period at ~ 2238 UT until the large 
disturbance at Meanook at 2242 UT is quite evident in the contours 
of Figs. 3a and 3b. 

There is evidence in the contours of Figs. 3a and 3b of perhaps some 
progression of the geomagnetic disturbance from the higher to the 
lower latitudes. For example, in the interval 2240 to 2241 UT the 
disturbance change is largest at College, while in the interval 2241 to 
2242 UT the disturbance change becomes largest at Meanook. If this 
progression of the disturbance is associated with a distortion and 
compression of the magnetospheric boundary to smaller radii (and 
therefore lower latitudes), the contours suggest that the boundary 
compression may have reached field lines that intersect with latitudes 
as low as that of Meanook. That this was apparently the case is 
discussed below. 
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Plotted in Fig. 4 as contours (linear interpolations) on a geographical 
map of North America are the magnitudes of the total horizontal field 
changes and the angles [from the D(Y) direction] of the changes 
measured between 2241 and 2242 UT, at about the time of the sudden 
changes in the apparent ATS 5 location from the magnetosheath to 
the magnetosphere (see Fig. 1). At this time, the field intensity at 
Plano is interpolated to be } 700y and the field change is aligned in an 
approximately NI-SW direction. These are the important geo- 
magnetic parameters that will be used in the next section to calculate 
the expected induced earth currents at Plano. 

Before calculating the earth currents however, it is of interest to 
further examine the magnetospheric environment at 2241 UT. At 
about this time, the Explorer 45 satellite was located at its apogee 
position and was about two hours closer to local noon than ATS 5 
(see Fig. 1). Magnetic field data and charged-particle data from 
Explorer 45 indicate that, at 2242 UT, when the magnetosphere 
boundary expanded outward beyond ATS 5, a movement of the 
boundary inward, inside Explorer 45, was recorded. Inspection of 
the magnetograms from College, near local noon, at 2241 UT 
indicate that the field increased in magnitude and that the field 
changes were predominantly in the H direction (see Fig. 2). This 
observation also suggests a compression of the magnetosphere boundary 
near local noon. 

Hence, summarizing and synthesizing as well as possible the avail- 
able magnetospheric data, the boundary of the magnetosphere in the 
local afternoon sector at & 2240 and 2242 UT could be pictured 
as in Fig. 5. Not only is the boundary greatly compressed from the nor- 
mal location (see Fig. 1), but it is also highly distorted as evidenced 
by the simultaneous observations of the magnetopause at altitudes 
higher than ATS 5 but lower than Explorer 45 at 2242 UT. This 
observation by Explorer 45 provides evidence of the most compressed 
position of the magnetopause yet recorded. The extreme compression 
and distortion of the magnetosphere recorded in the sector above the 
western hemisphere at 2242 UT was undoubtedly the primary cause 
for the large magnetic field changes recorded in North America at 
the time. 


Il. TELLURIC DISTURBANCES 


To put into perspective the analysis used to calculate the induced 
earth currents at Plano, it is of interest to review the basic relation- 
ships between earth resistivity structure, geomagnetic disturbances, 
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Fig. 3a—Minute-by-minute rate of change of magnetic field in the horizontal 
plane over North America from 2238 to 2241 UT. 
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Fig. 5—Equatorial plane view of earth and magnetospheric boundary in after- 
noon sector at 2240 UT and 2242 UT. 


and telluric disturbances; i.e., induced electric fields at the earth’s 
surface. 

The basic theory of tellurics is contained in a boundary value 
problem involving Maxwell’s equations and the resultant electro- 
magnetic wave equation. An external exciting source is assumed. 
The phase and amplitude relationships between the orthogonal com- 
ponents of the horizontal electric and magnetic fields observed at the 
surface of the earth are measures of the electrical properties of the earth. 

Electromagnetic induction by a uniform horizontal magnetic field 
B [with components H(NS) and D(EW) ] in a uniform semi-infinite 
earth produces orthogonal horizontal electric fields, /, which satisfy 
the following relationships :° 


mod (H#/B) « 1/T? (la) 

arg (/B) = 7/4, (1b) 

where T' is the period of the magnetic field B. Hence, the ratio of the 
amplitude of the electric field to the magnetic field is proportional to 


the inverse square root of the period 7 and the phase difference is 
always 7/4. 
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It is necessary to measure three mutually orthogonal components 
in order to completely describe the vector magnetic field. It is custom- 
ary in land-based magnetic variation surveys at midlatitudes to 
measure the horizontal H (geomagnetic NS) varying, horizontal 
D (geomagnetic EW) varying, and vertical Z components of the 
magnetic field. The sense of the variations is positive north for H, 
positive east for D, and positive down for Z. The fields are referenced 
from the earth’s surface and magnetic north. The induction process, 
therefore, is such that a positive H variation will produce an east-to- 
west flowing telluric current (a negative surface electric field). [At 
high latitudes it is customary to use the geographic NS (X) and 
geographic EW (Y) components. | 

Telluric current observations, together with the magnetic field 
measurements, usually show that the amplitude ratio //B is propor- 
tional to the inverse square root of the period. Thus, the relationship 
in eq. (la) suggests the use of a uniform conductivity earth model. 
However, observations of the phase differences between / and B 
seldom agree with the predictions of the uniform earth model (1b). 
Phase differences as small as 20° for magnetic field variations with 
periods of 20 to 30 minutes and as large as 45° for much shorter 
periods have been reported.1® These large discrepancies in phase 
differences between the theory and the observations underscore the 
fact that a uniform earth model is seldom applicable to the real earth 
at an observing location. 

Wait!® has developed an analytical formulation of the electro- 
magnetic fields of an infinite line source above a horizontally stratified 
earth. The earth model can be extended to include any number of 
layers. The line source can serve as an equivalent current system to the 
real current systems (e.g., ionospheric or magnetospheric current 
systems), which can produce natural electromagnetic fields at the 
earth’s surface. The basic relationship between the electric and mag- 
netic fields at the surface of the earth is given as 


T Holby = “A, (2) 


where up is the free space permeability and Z: is defined as the surface 
impedance. lor a three-layer earth model, the surface impedance is 
given as!é 

Z, = beta (3) 


U1 : 
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where wu is the propagation constant for the first layer, u. is the free 
space permeability, w is the angular frequency, and Q is a correction 
term that accounts for the resistivities and thicknesses of the three 
layers. The mks system is used throughout. 

The surface impedance defines the relationships between the 
tangential electric and magnetic fields at the earth’s surface. The 
surface impedance also completely represents the electrical properties 
of the earth, when displacement currents are neglected. Estimates of 
the surface impedance can be made from knowledge of the geology 
in a specific area. 

The apparent resistivity of the three-layer earth model is related 
to the surface impedance by the expression" 


- aw) 
pa(w) = = —|Za(w)* (4) 


The apparent resistivity p. (in ohm-meters) is highly dependent on 
the angular frequency w of the source and the conductivity structure 
of the three-layer earth. It is important to note that many published 
values of earth resistivity are, in fact, apparent resistivities that are 
valid only in a limited frequency range. For example, the use of earth 
resistivity values determined at 60 Hz will not normally be valid at 
the frequencies at which telluric currents are important (see Fig. 7). 

The numerical evaluation of eq. (2) is substantially simplified if the 
electromagnetic fields are considered plane waves. Since no physical 
sources exist in nature that produce plane waves, it is necessary to 
determine the range of frequencies and geographic conditions in which 
the fields of a line source can be successfully approximated by mathe- 
matical plane waves. 

Peeples!’ has shown that the plane wave approximation will be valid 
for wave periods less than 500 seconds for earth models that have a 
highly conducting sedimentary first layer, if the line current is located 
at a height of at least 100 km. Morrison}® has shown that, as the resis- 
tivity of the first layer increases, the period range decreases for which 
a plane wave analysis is valid. The plane wave approximation has been 
found valid in the frequency range (0.01 to 0.1 Hz) in which telluric 
current effects are important to long-haul communication systems. 
Hence, the surface impedance and apparent resistivity for a plane 
wave can be readily computed from eqs. (3) and (4) once the param- 
eters of the three-layer earth are established. The surface electric 
field can be computed for any value of the magnetic induction field 
by inserting the appropriate surface impedance value into eq. (2). 
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IV. PLANO EARTH RESISTIVITY MODEL 


In this section, a three-layer earth resistivity model is developed. 
The model is derived from the modified Cantwell-McDonald earth 
resistivity model! and detailed information of the geology” in the 
vicinity of Plano, Illinois. This model is assumed to be representative 
of the entire L4 route from Plano, Illinois, to Cascade, Iowa. 

Geophysical studies in the vicinity of Plano indicate that the granitic 
basement rocks are at a depth of about 1.2 km. Above the basement 
lies a relatively thick sequence of flat-lying sedimentary rocks consist- 
ing of Cambrian and Ordovician sandstones, shales, and dolomites. 
Unconsolidated glacial deposits overlie the bedrock with thicknesses 
of 18 to 36 meters. 

The three-layer earth model in Fig. 6 was arrived at by merging 
the upper crustal model with the modified Cantwell-McDonald earth 
resistivity model. Because of the relatively low frequencies encountered 
in telluric studies, it was felt that the layer of unconsolidated glacial 
deposits could be ignored without affecting the results. 

The top layer of sedimentary rocks was given a resistivity of 100 
Qm and a thickness of 1.22 km; the second layer of granitic basement 
rock was given a resistivity of 5000 Qm and a thickness of 300 km; 
the bottom layer, representing the upper mantle, was given a resis- 
tivity of 10 Qm and an infinite thickness. The surface impedance was 
computed using the earth parameters from Tig. 6 in eq. (8) after 
modification of the latter for the plane wave approximation. 

The more familiar apparent resistivity, which is related to the surface 
impedance through eq. (4), was determined to show the frequency 


AIR 


N 
EARTH Vhy = 1.22 km pq = 100 2m 


| 
| 
| 
| 
| 
| 
J hz = 300 km — py = 8000 2m 
| 
| 
| 
| 
| 


h3 = © p3 = 10 Qm 
Fig. 6—Three-layer resistivity model for Plano, Illinois, vicinity. 
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response of the earth model. Figure 7 illustrates the variation in 
apparent resistivity as a function of frequency. The decrease in the 
apparent resistivity at the higher frequencies arises from the influence 
of the top layer. A maximum apparent resistivity is shown at about 
10-? Hz; this reflects the influence of the highly resistive second layer, 
the basement rock. The drop in apparent resistivity below 10-7? Hz 
arises from the influence of the less resistive upper mantle as the 
frequency decreases and the depth of penetration of the magnetic 
field increases. Hence, in this layered earth model, there are certain 
frequencies for which the apparent resistivities are much higher than 
others. Earth resistivity models in general are nonunique, and extrapo- 
lations from them should be done with caution. 

The surface impedance based on the above earth model was used 
to compute the variation in surface electric field as a function of the 
source frequency. Figure 7 shows the variation in surface electric field 
when the orthogonal horizontal magnetic induction field is 700 gammas. 
The induced electric field is highly dependent on frequency. The 
flattening of the induced electric field at about 10-2 Hz corresponds 
to the maximum in the apparent resistivity at this frequency. A 
uniform earth model would not show this flattening, but only a con- 
tinuous decrease in electric field magnitude with decreasing frequency. 
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V. THE L4 SYSTEM AND EARTH POTENTIAL OUTAGES 


The L4 system consists of coaxial cables powered in pairs by power- 
feed stations. Normally, the maximum distance between power-feed 
stations is 242 km (150 miles), with a nominal repeater spacing of 
2 miles. The power system is grounded at one end, and the output 
voltages of the four de-to-de converters (each rated to deliver 1800 V 
and a nominal line current of 520 mA) are balanced so that the voltage 
to ground at the ‘floating ground” end is zero. Figure 8 shows a 
typical L4 system powering section in the presence of an earth po- 
tential. In the presence of slowly varying voltages, the floating ground 
has a threshold of 370 V, above which it becomes automatically 
grounded. The automatic grounding feature serves as protection to 
the system. The floating ground must be restored to its normal condi- 
tion manually. 

Direct-current earth potentials produce changes in the L4 system 
line current that can cause transmission impairment and/or converter 
shutdown. The line current is unaffected by earth potentials until the 
370 V threshold is exceeded. After grounding occurs at the floating 
point end, the earth potential appears in series with the metallic 
power-feed loops of both lines. The earth potential will increase the 
voltage on the line of the same polarity, causing an increase in that line 
current. The potential will decrease the voltage on the line with the 
opposing polarity, causing a decrease in that line current. Since earth 
potentials produced by geomagnetic variations frequently reverse 
polarity on rather short time scales, both lines will experience high 
and low currents. The magnitudes of the line currents are dependent 
on the following factors: 


(t) Magnitudes of the earth potentials. 
(17) Degree of balance of the power system. 
(417) Dynamic resistance of the line and the converters. 


System designers have analyzed an ideal model of the L4 system 
to estimate the line current variations as a function of earth potential. 
The results of this analysis with respect to converter shutdown on a 
standard 242-km power section are :7!:?2 


(t) At 6.5 V/km, the line with the aiding earth potential will 
experience a high current shutdown. 

(72) At 8.9 V/km, the line with the opposing earth potential will 
experience a low current shutdown of both converters feeding 
the line (see Fig. 8). 
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Fig. 8—L4 power-feed section in the presence of an earth potential. 


The high current shutdown at 6.5 V/km is considered the most 
serious effect of earth potentials and, as is discussed below, is most 
likely the cause of the Plano-Cascade L4 shutdown of 4 August 1972. 

The surface electric field induced along the Bell System L4 route 
from Plano to Cascade at approximately 2242 UT, 4 August 1972, 
can be determined from eq. (2). The magnetic field variation | AB/At| 
can be determined from Iig. 4 and the surface impedance can be 
determined from eq. (3) using the Plano earth resistivity model. The 
estimated geomagnetic disturbance of = 700 y/min (see Tig. 4) will 
induce a perpendicular surface electric field of 47.4 V/km (see 
Fig. 7). 

The estimated direction of the geomagnetic disturbance vector was 
about 40° north of east (see Fig. 4), making an angle of approximately 
70° with the Plano L4 route. For this angle, about 94 percent of the in- 
duced electric field, i.e., & 7 V/km, would directly affect the L4 power- 
feed section. Bell Laboratories engineers have established a working 
shutdown value range for earth potentials of 6.5 V/km +20 percent.” 
Comparison of this range to the Y 7 V/km value deduced for 4 August 
from magnetic field fluctuations shows that the surface electric fields 
that were most probably induced along the Plano to Cascade route 
at & 2242 UT 4 August 1972 were probably sufficient to cause the L4 
shutdown that took place. 


The Uniqueness of the Plano L4 Outage 


A complete explanation of why the Plano-Cascade section of the 
transcontinental L4 route experienced a shutdown and why other 
sections did not is not available at the present time. To establish some 


L4 AND GEOMAGNETIC DISTURBANCES 1833 


argument for uniqueness, it is worthwhile to outline some factors that 
might contribute to a telluric current shutdown. The major factors 
can be divided into geophysical conditions and system susceptibility. 

Previous sections of this paper have dealt with the importance of the 
rate of change of the inducing field |AB/At| and of the surface im- 
pedance in determining the induced telluric currents. To achieve a 
maximum telluric current or earth surface potential along an L4 
route, the horizontal magnetic field variation must be perpendicular to 
the direction of the route. 

The contour maps shown in Fig. 4 illustrating the area distribution 
of |AB/At| and its direction angle in the interval 2241 to 2242 UT 
across North America are a valuable tool for determining the magni- 
tude of the geomagnetic disturbances that actually affect an L4 
power-feed section. As mentioned above, an estimated 94 percent of 
|AB/At| at 2242 UT affected the Plano-to-Cascade power-feed 
section. The L4 power sections immediately to the east of Plano and 
immediately to the west of Cascade both have a more east-west 
direction than the Plano-to-Cascade section. This means that only an 
estimated 60 percent of |AB/At|, producing an electric field of 4.7 
V/km, will affect these sections. This estimated electric field value is 
below the lower-limit system shutdown value of 5.2 V/km. These 
comments must be qualified by the fact that, as noted earlier, the 
available magnetometer data were certainly not optimum to determine 
field changes and directions with high accuracy. 

The frequency response of the earth’s surface impedance along the 
cable route is also a very important factor in producing maximum 
telluric currents. As shown in Fig. 7, the frequency response of the 
Plano earth model was a maximum at 10 Hz, which is the fre- 
quency of the largest magnetic variations at 2242 UT. Analytical 
results from various earth resistivity models have shown the frequency 
response of the surface impedance to be fairly sensitive to changes 
in layer thickness. If the earth resistivity structure profile were 
known along the entire transcontinental L4 route, it would aid in the 
determination of which sections of the route are most susceptible to 
telluric currents. This information is not presently available, and 
probably will not be for quite some time. Thus, this discussion indi- 
cates that a combination of geophysical factors is needed to produce 
maximum telluric currents along a specified cable route. All these 
factors are very difficult to estimate at any particular time from the 
distribution of geomagnetic and telluric observatories presently 
existing in North America. 
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From the point of view of system susceptibility, each power-feed 
section of the L4 route will most likely have a different earth potential 
shutdown value. This condition exists because of variations in the 
power system balance, because of variations of dynamic resistance in 
the individual lines and convertors, and because of the length of any 
particular power-feed section. For instance, the Plano-to-Cascade L4 
power-feed section, at = 248 km, is the longest in the Bell System. 
The L4 sections immediately to the east and to the west are 213 
and = 230 km long, respectively. The decreased lengths in these 
sections increase the critical shutdown voltages to % 7.4 and 6.8 
V/km, respectively. 


VI. SUMMARY 


The preceding discussions, using most of the existing North American 
data available for the time period around the L4 system outage of 
4 August 1972, have indicated that the geomagnetic disturbances 
were apparently sufficient to produce the outage on the Plano, Illinois, 
to Cascade, Iowa, route. The geomagnetic disturbances were produced 
primarily by large distortions of the earth’s magnetosphere, whose 
boundary was observed to be pushed inward to an altitude of about 
four to five earth radii over North America at the time of the L4 
outage. Therefore, the geomagnetic disturbances that produced the 
outage did not arise from enhanced auroral current systems that are 
normally associated with power system problems during magnetic 
storms. 

It is difficult to establish the uniqueness of the L4 problem along 
the Plano-Cascade route during this large storm. A large part of this 
problem arises because of the insufficiency of the geophysical data, 
data concerning the magnitudes and spatial distributions of the 
magnetic fields and earth currents during the magnetic storm as well 
as data on the geological structure under the L4 route. 

In addition to problems of coaxial system outages from large 
geomagnetic disturbances, it is likely that smaller magnetic storms 
may occasionally induce sufficient currents on some routes in certain 
locales to produce transmission impairments. Experimental work to 
study this problem is presently under way. 
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New Frontiers of Varactor Harmonic Power 
Generation in the C-Band 


By S. V. AHAMED and J. C. IRVIN 
(Manuscript received June 24, 1974) 


Gallium-arsenide ‘“‘stacked’’ varactors employed in a frequency doubler 
and tripler have given 8- to 10-watt output at 4 and 6 GHz with efficiencies 
of 70 to 80 percent. 


1. INTRODUCTION 


We have designed and developed a new class of varactors that have 
been noteworthy beneficiaries of the techniques evolved for GaAs 
IMPATTs. In this paper we report the expcrimental results obtained 
from double-stacked diodes for frequency doubling (2 to 4 GHz), and 
double- and triple-stacked diodes for frequency tripling (2 to 6 GHz). 


Il. VARACTOR CONSTRUCTION 


The power and efficiency sought in the present application (8- to 
10-W output at 4 and 6 GHz with a minimum efficiency of 70 percent) 
dictate a low-loss, high-voltage varactor. These conflicting goals are 
best achieved by a series connection of two or more chips in a single 
package.! Using a Schottky-barrier GaAs chip, the power dissipation 
per chip is so small that a simple series stacking of one chip upon 
another is adequate, without heat sinks for any chip except the bottom 
one. This scheme is facilitated by using dice of typical C-band I MMPATT 
design,” i.e., squat cylinders or truncated cones (0.2 mm in diameter 
by 0.08 mm tall, for example) in which the active area is that of the 
cylinder itself. The dice are thermocompression-bounded in “‘flip-chip”’ 
position, one on another, in a package with a diamond heat sink. The 
series combination not only increases power-handling capacity con- 
siderably, but it also alters the input and output impedances favorably. 

The chips employed here were originally tested as IMPATTs, 
where they gave about 3 W at 10 to 12 percent efficiency in C-band. 
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As a double stack, these chips formed a varactor with typical values 
of 7.5-pF zero-bias capacitance, 174-V breakdown, and 0.65-ohm 
zero-bias resistance. In a triple stack, the varactor’s values were 
typically 5.0 pF, 265 V, and 0.98 ohm. The cutoff frequencies at 
breakdown are 300 GHz or greater. 


Ili. THE 2- TO 4-GHz FREQUENCY DOUBLER 


A coaxial embodiment of the doubler (see Fig. 1) has been chosen 
to yield experimental flexibility. The diode is mounted at the center of 
a 30-cm, 50-ohm coaxial airline. A 4-GHz reentrant choke to block the 
output frequency from the input and a 2-GHz quarter-wave-long 
transformer to match the impedance constitute the input side. A 
quarter-wave 4-GHz transformer and a twin capacitor 2-GHz filter 
make the output side. The axis of the inner conductor is positioned at 
the axis of the outer conductor by radial pressure between the diode 
and a long, thin, spring-loaded polystyrene pin. The frequency doubling 
by the diode has been simulated on a digital computer, and the effi- 
ciencies and bandwidth data are obtained together with the impedance 
for best results. The correlation between the simulated results and 
experimental data is shown in Fig. 2. The half-dB bandwidth at 4 GHz 
is approximately 300 MHz. 


IV. THE 2.115- TO 6.345-GHz TRIPLER 


A coaxial embodiment with three limbs in a T configuration (see 
Fig. 3) for the input, output, and idler circuits has been designed and 
constructed for the tripler circuit. The diode is mounted along the axis 
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Fig. 1—Cross section of the 1995- to 3990-MHz doubler. 
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Fig. 2—Efficiency characteristics of the doubler, double stack 7.5 pF, 174V, 
0.65 ohm. 


of the T opposite the idler stub with the remaining two limbs for input 
and output sides. A 2.115-GHz quarter-wave transformer, a 4.23-GHz 
(idler frequency) choke, and a 6.345-GHz choke suitably located with 
respect to one another and the diode constitute the input side. A 
2.115-GHz choke, a 6.345-GHz transformer, and twin-capacitor 4-GHz 
filter (also serving as a de block) constitute the output. The idler 
circuit consists of a 0.53-pF capacitor (which also blocks the de voltage 
because of average charge on the diode), a suitable length of the inner 
conductor, and a sliding short between the inside and outside conduc- 
tors. The axis of the central conductor between the input and output 
sides is aligned with the axis of the outer conductor by opposing radial 
pressures between the stub and the diode. The positions of the various 
components in the tripler that can be computed agree well with the 
experimentally determined locations. Fine tuning is accomplished by 
four pin tuners, two on the idler stub and one each on the input and 
output limbs. The correlation between the simulated results obtained 
from a digital computer and the experimental data is shown in F'ig. 4. 
The bandwidth that has been computed to be about 70 MHz at 
6.345 GHz is experimentally measured at 0.5-dB points as 61.2 MHz 
at 8 W, 50.4 AI Hz at 7 W, 58.8 \[Hz at 6 W, and 66 \IHz at 5 W fora 
triple-stacked varactor. This varactor is extremely resilient to the 
extent that no damage occurs by gross mistuning at 10 W. The double- 
stacked varactor has yielded 10 W but has been occasionally damaged 
while tuning, probably because of excessive avalanching. 
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Vv. CONCLUSIONS 


Both high power and high efficiency that approach the theoretical 
limit for GaAs diodes of this design may be simultaneously achieved 
by proper design of the varactors. The good control in doping density, 
the use of nonohmic back contacts, and a diamond-based heat sink 
that permits a tolerable temperature rise of the most thermally isolated 
chip in a stacked varactor, make these new varactors good contenders 
for microwave power by frequency multiplication. 
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This paper describes the general properties of a negative-impedance- 
boosted (NIB) transmission line, an otherwise uniform line to which 
lumps of negative impedance have been introduced periodically along its 
length. The purpose of these lumps is to reduce the attenuation of the line, 
but they also make possible self-oscillation and cause reflections. However, 
these reflections are cancelled up to a cutoff frequency that has an inverse 
relationship to the spacing of NIB units. Summaries are included of the 
latest NIB project; of previous investigations into NIB lines, many of 
which were not published; and of the first work in the Bell System to 
reduce transmission-line attenuation through the introduction of loading 
coils and then unilateral amplifiers. It is shown that many of the differ- 
ences between properties of NIB lines and properties of a line fitted with 
untlateral amplifiers come from the different ways the source of energy 
replenishment 1s coupled into the line. 

It is shown that, under suitable conditions, the addition of small simple 
NIB units to a transmission line can reduce its loss to a low level in a 
stable manner and also equalize it. Because of some properties of NIB 
lines, they cannot be used successfully in all situations; in particular, an 
environment in which large longitudinal currents are induced in the lines 
zs unfavorable. 


I. INTRODUCTION 


The use of negative impedances to reduce the attenuation of tele- 
phone transmission lines is the subject of this paper. The principal 
material is in Section V and deals with the general properties of trans- 
mission lines to which negative impedances have been added periodi- 
cally. These properties are compared with those of lines with loading 

1845 


coils and lines having conventional unilateral amplifiers. This use of 
negative impedances has been investigated a number of times in 
several ways over the past forty years, but very little of this work has 
been published. A brief history of this effort is given in Section VI. 
A short section on the properties of negative-resistance elements is 
also included. 

For perspective in viewing the use of negative impedances in trans- 
mission lines, a few highlights are provided from the early development 
of means to reduce attenuation in the lines of the Bell System. This 
development from early Bell System work makes a very interesting 
story in itself, parts of which may be found in Refs. 1 through 8— 
the account in Ref. 1 being especially good. However, this background 
information is given here not only for perspective, but because the 
introduction first of loading coils and then of gain by means of uni- 
lateral amplifiers furnishes two important points of reference in 
discussing the properties of attenuation reduction by means of negative 
impedance. 

The most recent study of the use of negative impedances in trans- 
mission lines, begun several years ago by L. A. Meacham, is sum- 
marized briefly in Section VII. 


ll, LOSS REDUCTION BY LOADING COILS AND AMPLIFIERS 


The first major advance in pushing back the barrier of distance in 
telephone transmission came around 1900 with the introduction of 
loading coils. Through the work of Heaviside and others, it had been 
known for some years that if the series inductance of a line could be 
increased, its attenuation would be decreased. But there was no satis- 
factory way to increase continuously a line’s inductance, although this 
was done to a limited extent in submarine cables later on. Soon after 
joining the AT&T Laboratories in Boston, Dr. George A. Campbell 
in 1899 developed a theory of loading for adding inductance in finite 
lumps to a line.’ In an unusual coincidence, a similar theory was worked 
out by Professor Michael Pupin of Columbia University at about the 
same time. In a patent-interference case, it was decided that Professor 
Pupin preceded Dr. Campbell by a few days. Campbell’s analysis, 
which was more detailed than Pupin’s, gave the relation between kind 

of line, size of coils, and their spacing and cutoff frequency. These 
relations were used to begin the introduction of the sOnee process 
and are still widely used today. 

Campbell’s invention of the wave filter came from the results of his 
analysis of the loading process, which was seen to make a low-pass 
filter of the line. The making of suitable coils was quite a problem 
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itself in those days, but within about a year, installations of coils were 
beginning and attenuations were being reduced to about one-half 
their former values. This was a great advance and made possible a 
large expansion of the telephone network in distance and use. The 
theoretical cutoff frequency of these loaded lines was about 2300 Hz, 
but because of the core material in the coils, the actual line losses 
began to increase considerably before this cutoff. With later improve- 
ments in coils and technique, line losses were often reduced to one- 
third or one-fourth the nonloaded values.? By 1905, with the use of 
these coils and #8 or #10 wire, the long-distance open-wire network 
in the East had been extended as far west as St. Louis and Kansas 
City without benefit of repeaters and by 1911, Denver had been 
reached. 

An important goal at the time was to link the east and west coasts 
by telephone. To do this, either #5 wire (0.18-inch diameter) had to 
be used in the line from Denver to San Francisco, or a suitable form 
of repeater had to be found. The repeater path appeared to be the 
more economical and so the work already under way on repeaters and 
their use in transmission lines was expanded. 

In fact, between the invention of the telephone and the appearance 
of the high-vacuum-tube amplifier in 1913, there was a large amount 
of effort expended, both inside and outside the Bell System, to devise 
a workable amplifier for telephone signals. During this time, many 
inventions were submitted to the AT&T Company for possible use 
as telephone repeaters. Some of these involved rotating electrical 
machinery or other ponderous mechanical devices that were not 
suited to high-frequency telephone signals. Others, on analysis, 
turned out to be simply transformers with no energy source.* An 
amplifier is really a modulator in which an incoming signal, with a 
small expenditure of energy, modulates a larger local energy source 
to produce a gain as it repeats the signal. In a vacuum-tube amplifier, 
the de plate current supplied by a battery is modulated by the grid 
voltage. In a transistor amplifier, the collector current is, in effect, 
modulated by the much smaller base current. In a carbon-button 
amplifier, the varying resistance of the carbon granules in response 
to the motions of the receiver armature modulates the de flowing 
through the button with a power gain of several hundred. In a para- 
metric amplifier, the source of energy is a local oscillator supplying 
pump current which is modulated by the signal. And so on. 

During the time when a suitable repeater was most actively sought, 
only two devices showed enough promise to be seriously considered. 
One of these was a combination of carbon-button transmitter and 
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receiver which, in its best version, had the transmitter and receiver 
coupled mechanically instead of acoustically. A few commercial 
installations of this device were made although it had a number of 
problems, one of which was variability of gain. The other device was a 
gaseous discharge that fundamentally had the characteristics of a 
negative resistance rather than a unilateral amplifier. 

At this point, the three-electrode vacuum tube appeared on the 
scene and showed such great promise that the first two devices were 
soon put aside. The first high-vacuum-type repeater was installed 
in late 1913, a little less than a year after H. D. Arnold had been 
shown DeForest’s three-electrode tube. By the middle of 1914, three 
improved repeaters had been successfully installed and operated in 
the new transcontinental line. The net loss of this line was 20 dB and 
the 10-dB bandwidth was from 350 Hz to 1250 Hz.1+4 


Ill. PROBLEMS AND PROPERTIES OF REPEATERED LINES 
USING UNILATERAL AMPLIFIERS 

The introduction of repeaters into the telephone plant showed up 
transmission problems that had been obscured before. At any point 
in the line where its uniformity is disturbed (called an irregularity), 
part of a wave traveling down the line will be reflected back toward 
the wave’s source. Such a reflected wave is like an echo. When the net 
loss of the line is 20 dB or more, the returning echoes are weak and 
hardly noticeable. But when gain was added to such a circuit, the 
echoes could be very objectionable. In fact, they could limit the 
amount of usable gain. lor example, the net loss in the first trans- 
continental line in 1914 was limited to 20 dB. 

Another factor in the effect of echoes is the transmission time of the 
line. Because of the inductive loading, the transmission time of the 
transcontinental line was 70 ms, which is high enough so that if 
strong echoes occur they will be objectionable. As understanding of 
these effects grew, the loading was removed, reducing the transmission 
time to 20 ms. With this change, the loss increased about 100 dB but 
more gain could be added, reducing the net loss to 12 dB. An extra 
benefit that came from the removal of the loading was higher-quality 
speech transmission because of the increased bandwidth.® Since the 
attenuation vs frequency characteristic of open-wire lines is fairly 
flat, the use of loading restricted the bandwidth. Gradually all loading 
coils were removed from open-wire lines. The situation is different 
with cable pairs, which have a considerably higher capacitance. Here 
the coil loading tended to flatten the attenuation vs frequency curves 
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so that only a little equalization was required at the repeaters. Hence, 
loading has generally been kept in voice-frequency cable circuits, 
although the amount of loading has been reduced in long circuits to 
reduce transmission time. 

Another effect of the reflections when repeaters are in the line is the 
possibility of free oscillation in the line, or ‘‘singing.’”’ If an amplifier is 
inserted into a perfectly uniform transmission line in such a way that 
it does not cause reflections, the gain of the amplifier may be as large 
as desired. But if there are reflections, the amount of gain that can be 
inserted will be very definitely limited by either echoes or sing- 
ing.» 8" Reflections can be caused by an improper termination of 
the line, by a missing or misplaced loading coil, or by the way amplifiers 
are coupled into the line. 

To obtain bilateral transmission with a unilateral amplifier, a 
special kind of circuit is required to couple the amplifier into the line. 
Some form of bridge circuit is the usual means—a Varley loop was 
first used for telegraph repeaters. Coil arrangements for doing this 
are called hybrid coils, one of which plus one amplifier make a 21-type 
repeater that can amplify in both directions. But its losses are high 
and it sends amplified signals in both directions with the effect of an 
echo even though the lines in both directions are well-balanced against 
each other. Hence, they were not used very much. The 22-type repeater, 
shown in Fig. 1, uses two hybrid coils, two amplifiers, and two line- 
balancing networks, and gives much better performance. Losses are 
minimal and reflections come only from imperfections in balance 
between the line in each direction and its balancing network. This 
means for coupling amplifiers into a line was invented in 1895 (U. S. 
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Fig. 1—Diagram for connecting unilateral amplifiers and hybrid coils to make a 
22-type repeater. 
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Patent 542,657) by a Bell System engineer, W. L. Richards, long 
before there were good amplifiers with which to use it.4 

After the successful introduction of the 22-type repeater into the 
system, the AT&T Company continued to have submitted to it 
inventions that purported to reduce in varying degrees the complexity 
of the means for coupling unilateral amplifiers into the line. They were 
mainly variations of the basic bridge circuit or were completely 
unworkable. No satisfactory substitute for the hybrid coil arrange- 
ment of Fig. 1 has appeared except for the very different approach 
to overcoming line loss by the introduction of negative resistances, 
which will be discussed below. 

As may be seen from Fig. 1, the amount of gain that can be obtained 
from this repeater configuration before local singing occurs depends 
on the loss across the hybrids (vertically in the diagram) and is higher 
for higher losses. This loss is infinite if the network exactly balances its 
corresponding line and is zero if the line presents a short or open 
impedance. Thus, to operate these repeaters with considerable gain 
in two-wire lines requires a fairly high degree of balance between 
networks and lines. When a number of repeaters is used in tandem 
in a line, additional possible singing paths are introduced as well as 
additional echo sources at each departure from uniformity, which 
from a practical viewpoint is at each repeater. To eliminate these 
multiple singing paths and echo sources, four-wire transmission was 
introduced in many toll circuits and is universally used now in all 
carrier circuits. In this arrangement only two hybrid coils are used, 
one at each terminal. The upper path in Fig. 1 becomes a two-wire 
line with a sufficient number of east-west amplifiers approximately uni- 
formly spaced. The lower path is similar but transmits only west to 
east. The singing problem is greatly reduced with this arrangement 
because both horizontal transmission paths between the terminal 
hybrids have very little if any gain since each includes the line loss 
as well as the amplifier gain. In some respects, the four-wire configura- 
tion of repeaters is similar to the use of series negative-impedance 
boosters, as will be seen below. 

If the two wires of the pairs in the two-wire and the four-wire 
circuits, and the repeater circuits connected to them, are well-balanced 
against each other, interference currents will largely be drained off to 
ground away from the repeaters and terminals. 

The 22-type repeaters were economically as well as technically 
successful in toll transmission, but were deemed too expensive and 
complicated for exchange trunks where gain was needed in the general 
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upgrading of telephone quality in the nineteen twenties. Partly because 
of continuing study of repeatered systems and partly because of the 
need for a cheaper repeater, as indicated above, different ways of 
reducing line attenuation were sought. 

One way, quite different from the use of unilateral amplifiers de- 
scribed above, and which appeared to hold considerable promise, was 
that of introducing negative resistances into the line to cancel, or 
partly so, the positive line resistance, which, because of good dielectrics, 
is the principal source of loss in lines. While some thought had been 
given to such use of negative resistances at about the time the vacuum- 
tube repeater appeared, as indicated above, a considerable effort in 
this direction began a little before 1930 and has continued off and 
on until the present. A review of this work is given below in Section VI. 

Now, with our summary of the processes by which gain may be 
obtained, we are able to see a certain equivalence between an amplifier 
and a negative resistance. Yet the difference in the ways they must be 
used in a transmission line, or in the particular form one or the other 
takes, may make it more desirable to introduce the gain in one of the 
ways instead of in the other. 

The next section will describe some of the properties of negative 
resistances and devices that exhibit this characteristic. 


IV. NEGATIVE RESISTANCES 


A positive resistance absorbs energy from connected circuits and 
dissipates it. A negative resistance delivers energy to connected 
circuits and so must have within it a source of energy on which it may 
draw. More precisely, a negative resistance as a device is able to 
convert the energy of a local source, to which it is connected, into a 
form suitable for passing on to other circuits. In other words, it must 
be able to modulate the energy from the local source as an amplifier 
does. An amplifier, though, is usually a three-terminal device in which 
the energy stream is controlled externally. A negative resistance, on 
the other hand, is a two-terminal device utilizing some internal phe- 
nomenon that depends on the voltage across it or the current through 
it to control the energy stream. An early paper by Crisson! describes 
two kinds of negative resistance. He shows also in this paper that either 
form of negative resistance can be generated by connecting the output 
of an amplifier to its input either in a series way or a shunt way. This 
suggests that in general a negative resistance is the result of some kind 
of feedback process, that is, one in which current or voltage depends 
on itself in some way, either externally or internally. A good general 
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paper on negative resistance and devices which exhibit it is the one by 
EK. W. Herold.# ‘ 

Some parasitic reactance may be associated with a negative- 
resistance device and/or some may be intentionally added to it so 
that, in general, an impedance with a negative resistance component 
is what is used. The general term negative impedance is applied to 
such a device. Because of the reactance, the impedance usually does 
not have a negative resistance component above a certain frequency. 
It is possible also for negative impedances to be generated in a largely 
reactive device, such as a varactor, by an internal modulation process. 
Here the energy source is not dc but the external pump oscillator. 

When a negative resistance is generated by connecting the output 
of an amplifier to its input, the voltage-current characteristic is a line 
with negative slope extending from the origin to where the amplifier 
begins to overload. Here, in effect, the true energy source is concealed 
within the device. However, in many negative-resistance devices, the 
energy comes to the device from the source through a direct current 
that also flows through the two terminals of the device. Over a certain 
range of values of this current, the device exhibits a negative resistance 
and so is able to convert energy from the source to a form suitable for 
use in the connected circuits. All known negative resistance devices 
of this type have either one or the other of two shapes of voltage- 
current characteristic, as illustrated in Fig. 2. The series type is shown 
in Fig. 2a, and the shunt type in Fig. 2b. Between points A and B, 
the slope dV /dI is negative and, for current variations in this range, 
the device presents a negative resistance to external circuits. The 
magnitude of the negative resistance is this slope expressed in ohms. 

To get an understanding of the effect of a negative resistance in a 
circuit, consider the diagrams of Fig. 2c, where a positive resistance R, 
has been added to the negative resistance R and the resulting overall 
voltage-current curve drawn. When the added positive resistance just 
balances the negative component, the V-I curve of the combination 
has a slope of zero between A and B (Fig. 2c). That is, if, as indicated 
in the figure, a varying signal current is superimposed on the bias of 
direct current supplied by the energy source, the voltage drop across 
the combination of positive resistance and negative resistance device — 
is zero as long as the variations remain within the range A to B. This 
means that all the positive signal energy dissipated by the positive 
resistance R, comes from the negative resistance R and none from the 
signal generator e. This is an indication of how the negative resistance 
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Fig. 2—Voltage-current curves for negative resistance. 


supplies energy to the connected circuits by appearing to cancel or 
nullify the positive resistance, or a part of it. 

Of course the borderline case just discussed could not really exist 
because the signal generator e, being zero, is not in control. To under- 
stand this incipient instability, let R, be less than RF so that the slope 
of the V-I curve between A and B is now negative as in Fig. 2a, and 
consider also the small parasitic inductance / which would be in series 
with the resistances. Then the signal current 7 would be given by the 
expression 

a=e/(sl1—R+R,), 


where s is complex frequency o + jw. If the magnitude of FR is greater 
than that of R., the impedance of the circuit has a root in which s 
has a positive real part and so is unstable. If, in addition, there is a 
resonance of reactances associated with the circuit, sustained oscil- 
lation will occur at the frequency of resonance. Thus, there is an 
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instability barrier that prevents the negative resistance from supplying - 
any more energy to connected circuits than just enough to cancel the 
losses in the external positive resistance. 

Because of the instability conditions just described, the V-I charac- 
teristics of Fig. 2 cannot be obtained directly from measurements on 
the device alone. If the voltage in Fig. 2a or the current in Fig. 2b is 
increased from 0 until point A is reached, the current in Fig. 2a or the 
voltage in Fig. 2b will appear to jump to point C, skipping over the 
intermediate part of the characteristic. Actually this is a very fast 
transient with a positive real exponent as indicated above. This 
instability will not occur if a positive resistance a little larger than the 
magnitude of negative resistance is connected in series with the series 
device, nor will it occur if a positive conductance a little larger than 
the negative conductance is shunted across the shunt device. Hence, 
the names, series and shunt, are given to the two types of negative 
resistance. The characteristics, Figs. 2a and 2b, are deduced by mea- 
surements of the combination circuit, as shown in Fig. 2c. The slope of 
the negative-resistance region can be measured quite accurately in 
such a circuit by adjusting R, to obtain a null of signal voltages across 
the series combination of R and R,. 

One of the earliest negative-resistance devices is the carbon arc 
(or other gaseous discharge). This is the series type and is generated 
when the ionization in the discharge begins to increase by means of 
the current of the discharge itself. The avalanche process in semi- 
conductors is the modern counterpart of this. The dynatron invented 
by A. W. Hull'‘is of the shunt type and depends on secondary emission, 
which becomes appreciable when a voltage reaches a certain value. 
Other devices that can exhibit negative resistances are thermistors, 
avalanche diodes and transistors, tunnel diodes, IMPATT diodes,'® 
and Gunn-effect diodes.1® The paper on IMPATT diodes by K. D. 
Smith is a particularly good description of this device and of the 
general properties of such negative resistances. As the physical dimen- 
sions are reduced, the parasitic reactances are reduced and the device 
can show a negative resistance at higher frequencies. The IMPATT 
and Gunn-effect diodes differ from the others in that they show a 
negative resistance only in a narrow frequency band, depending on 
their sizes. 

Other sources of negative resistance are made by combining several 
devices such as two vacuum tubes or two transistors and, in this form, 
they are often called negative-impedance converters.!7 Vacuum tubes 
were used in the first E-type negative-impedance repeaters and 
transistors in the latest ones and also in the work described below. 
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When negative resistances are made in this way, the opportunity 
exists of modifying the variations of the negative resistance with 
frequency and/or current by adding passive circuit elements to the 
configuration in order to serve particular needs. They also can be 
made in integrated-circuit form for small size. 


V. GENERAL PROPERTIES OF TRANSMISSION LINES WITH 

NEGATIVE-RESISTANCE LUMPS ADDED PERIODICALLY 

It was shown in Section IV how a negative-resistance device supplies 
energy at signal frequencies to circuits in which it is connected, acting 
as if it cancelled the effect of positive resistance in the same circuit. 
Hence, it is natural to think of this as a way to reduce the attenuation 
of a transmission line. This is not a new idea. It was mentioned as a 
possibility in the 1919 repeater paper* referred to above. The term 
booster was applied to negative-impedance devices used in this way 
in that paper and also in another.” This name was revived by Meacham 
in his negative-impedance work. In quite a bit of the literature, the 
process of adding negative impedances to a transmission line is called 
negative-impedance loading, drawing a parallel to the familiar practice 
of coil loading. The term boosting seems to the author to be a better 
description of the process and will be used here. 

This section deals with some of the general properties of negative- 
impedance-boosted (NIB) lines. This will be done without reference 
to any particular form of negative impedance, as far as is possible. 
However, to discuss the particular shapes of the resulting transmission 
curves or the particular details of stability, it is necessary to completely 
specify the properties of the negative impedance used. It is a fact also 
that whether or not the general idea of boosting is attractive may 
‘depend on the existence of a booster with certain properties. Thus, 
while the possibility of using boosters to overcome attenuation was 
recognized in the Jewett paper* of 1919, as mentioned above, no very 
suitable devices were available then. 


5.1 Coupling negative impedance into the line 


The simplest method of utilizing negative-impedance boosters 
(NIB) in a line is to connect the series-type units in series with both 
wires of the line and uniformly spaced along it, as shown in Fig. 3. 
In this form, the boosters are simply two terminal devices through 
which the line currents flow. A direct current, which flows through 
boosters and line along with signal currents, supplies the necessary 
energy to the boosters. The unit may consist of a single special element 
or a combination of several circuit elements. Experimental versions of 


NEGATIVE-IMPEDANCE UNITS 1855 


NEGATIVE — IMPEDANCE — 
BOOSTER UNITS | 


Fig. 3—A NIB line made by the periodic series insertion of negative-impedance 
units into a transmission line. 


the latter arrangement have been made very small in size using hybrid 
integrated-circuit techniques. In the E-type negative-impedance 
repeaters,’ ° transformers are used to couple the boosters into the 
line, and these repeaters are used only at central offices. Thus, the 
manner of using negative-impedance boosters is quite different from 
the manner of using unilateral amplifiers. 

While most of the descriptions in this paper are in terms of boosters 
added in series with the line, reduction of attenuation may be obtained 
also by adding suitable negative resistances in shunt to the line. Or a 
combined series and shunt connection may be used. In the latter 
circumstance, the combination booster may be made to present to the 
line an impedance that approximates the characteristic impedance 
of the line. This has been done in some of the E-type negative-imped- 
ance repeaters?! discussed in Section 6.3. Reflections in this case are 
small and the behavior of the line in this respect is more like one 
equipped with conventional repeaters. 


5.2 Bilateral transmission 


As may be seen from the discussion about Fig. 2c in the previous 
section, the performance of the negative resistance is Just the same 
whether generator e is on the right or the left. And so another general 
property of boosted-transmission lines is that transmission over the 
line is bilateral and symmetrical, as it is for the uniform nonboosted 
line. However, this is true only over a certain range of current through 
the line, i.e., the range within which the booster presents a negative 
resistance, as shown in Fig. 2. This property exhibits a fundamental 


difference between the negative-resistance energy-conversion device 
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and the unilateral-amplifier energy-conversion device. 


5.3 Necessity of using lumps of negative resistance 


If the negative resistance could be added in infinitesimal bits of 
ohms, the line resistance would be neutralized continuously and an 
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ideal line with propagation constant jwVLC and image impedance 
VL/C would result. But, for the present at least, negative resistance 
is available only in sizable finite lumps and so this is what must be 
used. There are several consequences. First, adding lumps of negative 
resistance does not yield the same propagation constants and image 
impedance that adding the same total amount of negative resistance 
continuously does. Formulas are given in Section 5.5 below. Second, 
the added lumps cause reflections, the effects of which are described 
in Section 5.4. Third, the consideration of even very small lumps of 
pure negative resistance is nonphysical, since this implies an infinite 
energy source to supply the infinite band of frequencies. Of course, 
parasitic reactances in real devices prevent this conceptual difficulty 
from arising in practice. But there is a further restriction. In Section 
5.9, it may be seen that, because of transmission-line properties, the 
negative resistance must be reduced as frequency increases to avoid 
instability. 


5.4 Reflection effects and cutoff frequency 


The addition of these lumps of negative resistance to the line 
introduces reflections, because each negative resistance presents a 
discontinuity to traveling waves and so each is a source of reflections. 
But, as shown by G. A. Campbell in the case of added series induct- 
ance,’ if the added series impedances are spaced uniformly, the sum of 
all the reflections is zero up to a certain cutoff frequency. This cutoff 
occurs at the frequency for which the spacing of the added impedances 
is one-half wavelength of the boosted line. Thus, an infinitely long 
boosted line behaves like a smooth line up to the cutoff frequency. 
Another consequence of the process is that the cutoff frequency is 
related in an inverse way to the spacing of the added impedances, as 
shown in Fig. 4. More detailed relations are given in Ref. 22. 

Another general property, then, is that a boosted line has a cutoff 
frequency because of the reflections introduced by the boosters and 
that this cutoff is related to the spacing between boosters in a more or 
less inverse way, and that up to the cutoff, all the reflections cancel 
so that the line looks like a smooth line for these frequencies. 


5.5 Propagation formulas for NIB lines 


Adding circuit elements in finite lumps to an otherwise uniform 
transmission line results in a line with different properties than one to 
which the same total amount of elements has been added continuously. 
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Fig. 4—Relationship between bandwidth and spacing in NIB lines. 
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Fig. 5—Two forms of NIB line sections with equivalent circuits and image im- 
pedances. Zz is the total impedance of a pair of NIB units. 
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The formulas for propagation constant and characteristic impedance of 
a uniform line cannot be modified by replacing the old element value 
with a new average value. The method used to derive formulas for the 
boosted line was first suggested by Campbell,’ in his calculations on 
the addition of loading coils. Many of the formulas appear in un- 
published notes written around 1940 by Bullington™ and around 1950 
by Van Wynen,”* both of Bell Laboratories. To obtain these formulas, 
the appropriate line sections are combined with the negative-impedance 
unit Zz in either of two ways shown in Fig. 5 and an equivalent 7’ or 
a network derived. These complete NIB sections, in either of the two 
forms, may then be cascaded to form a NIB line of the desired length. 
The midsection and midbooster image impedances of the boosted 
line are 


msZin = ZoeNZa/Zp 


and 


mbZr => 4 VZ4Zp, 


where 
ZA = 22. se + ZL 
ZB = 22 oc Si ZL, 


and Z,- and Z,, are the short- and open-circuit impedances of one-half 
a line section and Z, is the booster impedance, as indicated on Tig. 5. 
The propagation factor per section of the boosted line, P, is given by 


sinh (P/2) = VZ4/(Zp = Za). 


Another method of analysis, used in the work of A. L. Hopper,” is to 
represent each of the three cascaded parts of the NIB section by a 
matrix. The A form transmission matrix for the half-section of line is 


cosh p Z, sinh p 
M line = { sinh p 
Zo 


? 


cosh p 


where p is the propagation factor for the half-section of line and Z, is 
its characteristic impedance. The matrix for the series booster is 


ee 6 tees: 
M booster = & ‘ 


By multiplication of the three appropriate matrices, a new matrix 
for the full NIB section of Fig. 5a is found. The elements of this new A 
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matrix are 


Ay = cosh 2p + ee sinh 2p = cosh P 

Ay = Z, sinh 2p + Zz, cosh’p = Zz sinh P 
_sinh2p , Zi.,, _ sinhP 

Aun = a + GF sinh’ Ss 

Arg = Aut. 


Here, 2p is the propagation factor for the full section of line only. 
In the final column above, the matrix elements are expressed in terms 
of the parameters of the full boosted section, P being the propagation 
factor and Zy the image impedance. These may be calculated from 
the matrix coefficients. By multiplying in sequence n matrices of the 
last form, the equivalent transmission matrix of a cascade of n NIB 
sections may be obtained and its properties determined. 


5.6 Image impedance 


Another general property of boosted lines is that the image imped- 
ance is fairly close to being resistive and fairly close to being flat with 
frequency. This property is described best by the curves of Fig. 6 
computed for the particular realization of negative impedance devised 
by L. A. Meacham and described in more detail below in Section VII. 
The parameter Ryer or Ar is introduced here. It is the sum of line 
resistance plus booster resistance, at very low frequencies, per section 
or per mile or for a whole line, as designated. The figure shows the 
image impedance Zy for a particular NIB line configuration along 
with the Z, of the nonboosted line. While Zz is smaller than Z, over 
much of the frequency range, it is not as small as VL/C, which it would 
be if the negative resistance had been added continuously. However, 
its property of being fairly close to resistive and flat with frequency is 
a great improvement over the Z, of nonboosted lines. Its smaller 
magnitude reduces crosstalk considerably. 

As will be seen in Section 5.10, the image impedance at very low 
frequencies may not be just as shown in Fig. 6. The frequency scale 
would have to be considerably expanded to show this. 


5.7 Transmission properiies 
5.7.1 Infinite line 


The next general property (related to the previous one, of course) 
is that the transmission characteristics of the boosted line are much 
better than those of the nonboosted line. As mentioned above, the shape 
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Fig. 6—Image impedance vs. frequency for a 26-gauge NIB line with 6000-foot 
booster spacing. Parameter Ar is the net resistance at very low frequencies per section 
of NIB line. The characteristic impedance Z, of the line before the insertion of NIB 
units is also shown. 


of these characteristics may be chosen by adjustment of the booster 
parameters. When the choice is for as flat an attenuation curve as 
possible, the two parts of the propagation constant are plotted in 
Figs. 7, 8, and 9 for the same particular line configuration used to 
describe Zz, along with propagation of the nonboosted line. 
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Fig. 7—Attenuation vs frequency for a NIB line and the uniform nonboosted line. 


To further highlight the transmission properties of boosted lines, a 
comparison with coil-loaded lines is made. The boosting process 
is a much more powerful one because it adds energy at signal fre- 
quencies. In a coil-loaded line, the reduction in attenuation is directly 
related to the amount of inductance added. Hence, reduction in loss 
is related to cutoff frequency and spacing because of the relation 
between these two and inductance. In the boosted line, the attenuation 
reduction can be varied by means of the parameter Ar, independently 
of the booster spacing and cutoff frequency. The relation between 
these two latter parameters is shown in Fig. 4. 
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Fig. 8—Deviation of phase from linearity for a NIB line (26-gauge wire and 6000- 
foot booster spacing) and the uniform line. 


The comparatively sharp cutoff of the coil-loaded line is accompanied 
by large phase distortion, roughly 10 or more times that of the boosted 
line. The curves of Fig. 9 show that the phase velocity for coil-loaded 
lines is very low while that of NIB lines is high. However, the introduc- 
tion of coil loading has been very beneficial in improving and extending 
the telephone plant. 


5.7.2 Finite lines 


All the discussion so far has been in terms of the properties of the 
infinite line. These can be approached in the finite line if the termina- 
tions are close to the image impedance, otherwise there are differences. 
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Fig. 9—Phase velocity vs frequency for a NIB line and for several coil-loaded lines. 


We saw above that all the reflections from the NIB units cancel in the 
infinite line. But when the terminations of a finite line differ from the 
image impedance, all these reflections no longer cancel and the inpul 
impedance of such a line is not smooth like the Zz curves. There will 
be one ripple for each boosting point and the magnitude of the ripples 
will be directly related to the magnitude of the difference between the 
actual termination and the image impedance. The same applies to the 
insertion-loss curves. 


1864 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 





Ton = Tes = 34.6 Rs = 6070 
Rnet = 21 OHM / SECTION 


8 80 
i?) 
Ww 
— uJ 
ac 
oO 
ao 
= 6 40 = 
= < 
z 
z #6 ° 
B q 
S > 
+4 Oo w 
2 a 
o wi 
kK WY) 
cc LOSS = 
7p) a 
< —— 
2 -40 
Q 
0 1 2 3 4 5 6 


FREQUENCY IN kHz 


Fig. 10—Insertion loss (in dB and deviation of phase angle from linearity) of a 
five-section NIB line terminated with resistances. 


An example of this effect and of the performance of a NIB line of 
finite length having resistance terminations, which are simple approxi- 
mations to the image impedance but not the best, is given in Fig. 10. 
It should be noticed that the spacing of NIB units and, hence, the 
frequency band for this line are not the same as in Figs. 6, 7, 8, and 9. 
Where it is desired to have the return loss as high as possible, a better 
approximation to the image impedance should be made. See Section 
6.2 for a possible method. 

The ripples in the transmission curves are considerably larger when 
the line ends with half-NIB units rather than half-line sections as in 
Fig. 10. 


5.8 Limit of gain 


Another general property of boosted lines, in common with all 
circuits to which negative resistance or gain is added, is the possibility 
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of self-oscillation, or instability. While, under certain conditions of 
termination, a small amount of insertion gain may be introduced by a 
boosted line, in most applications, the boosters should go no further 
than to cancel, or nearly cancel, the line losses in order to maintain 
stability, as suggested in Section IV. How closely this limit can be 
approached depends largely on the line terminations and on the margin 
against instability in the infinite line used in the choice of booster 
parameters. A similar limitation exists for a uniform line equipped with 
unilateral amplifiers when provision is made for two-way transmission. 
A net loss of zero can be approached as all reflections from imperfect 
terminations approach zero. Of course, a one-way transmission line 
can, in principle, have as high a gain as desired if terminations are 
perfect and there are no return paths for the signal. In some ways the 
NIB line is like the transmission portion of a four-wire repeatered line 
in which the overall net loss is near zero. The complex subject of 
stability of NIB lines will be dealt with in more detail in Section 5.10, 
below. 


5.9 Restriction on the shape of negative resistance vs frequency 


There is a further restriction on finite lumps of negative resistance 
used as boosters at finite spacings in a transmission line. If the total 
amount of negative resistance added to a line approximately cancels 
the series resistance of the line wires at low frequencies, then the 
magnitudes of the negative resistances must be reduced at higher 
frequencies to avoid self-oscillation. This is a consequence of the wide 
range of impedances that a finite length of transmission line can 
present for various frequencies and terminations. How the restriction 
arises will be seen in the next part. If the reduction of negative resist- 
ance at higher frequencies is made with careful consideration, the 
transmission characteristics can be shaped in a number of desirable 
ways, and stability is achieved. This will be discussed below in the 
sections on particular realization. Incidently, another consequence of 
the wide range of line impedances is that a series negative resistance, 
often referred to as open-circuit stable, can cause instability in an 
open-circuited transmission line. 


5.10 Stability in NIB lines 


When lumped negative impedances are inserted into a transmission 
line periodically, self-oscillation can occur; that is, the line can be 
unstable. While it is the presence of the negative impedances that 
brings about the possibility of instability, it is pointless to try to talk 
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about the stability of the negative impedances. What needs to be 
investigated is the whole circuit since, with the same negative im- 
pedance unit, stability will exist for some values of connected passive 
elements and instability for others. 


5.10.1 Infinite line 


First consider the infinite line with uniformly spaced NIB units 
since this is simpler to deal with than the finite terminated line and 
also is the initial condition from which the latter is derived. The 
following material is presented as reasonable description rather than 
precise, rigorous analysis. 

The boosted line is characterized by a propagation constant and 
image impedances. The two principal image impedances, as with 
coil-loaded lines, are the midsection one, msZy, and the midbooster 
one, mbZy, as indicated in Fig. 11. Calculation and measurement of 
boosted lines show that when the negative resistances nearly cancel 
the line resistance, msZz becomes very large and mbZy becomes very 
small near the frequency for which the spacing between NIB units 
is one-half wavelength. This is illustrated in Fig. 12. The line impedance 
will have values intermediate between these extremes at other points 
of the section. Also, all sections will be alike in this because of the 
assumed uniformity. Since the midbooster image impedance can 
become very small, it appears to be a likely parameter to consider in 
investigating the stability of the line. To carry this out, imagine that, 
as indicated in Fig. 11, a booster has been divided into two equal parts 
and separated so that a generator e can be inserted into the line there. 
The current that flows at this point will be 


t= e/(QmbZyz). 


This would be the same at any one of the boosters because of the 





| mbZy 


Fig. 11—Representation of NIB line for studying stability. S is space between 
NIB units of impedance Z, as in Fig. 5. 
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Fig. 12—The two image impedances Zy of a NIB line. 


uniformity. As shown in Section 5.5 above, the midbooster image 
impedance may be calculated from the formula 


mbZn = vZaZzp, 


where 
ZA =~ 22 sc + ZL 
ZB = 2Zoe + Z1, 


and where Z,, and Z,, are the impedances of one-half section of line 
that has been short-circuited then open-circuited at the far end, 
and Z, is the impedance of the NIB unit. Thus, Z4 is the impedance 
seen looking in at the test point of Fig. 11 when the two adjacent 
midsection points are shorted and Zz is the impedance seen when they 
are opened. 
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The irrational nature of Zy makes it difficult to analyze in the usual 
way. However, if Zz becomes zero under certain circumstances, we 
can be fairly sure that instability will accompany those circumstances. 
First consider separately Z4 and Zz as plotted in Figs. 13 and 14 
from calculations of a particular NIB line. In each of these is a separate 
enlarged plot of the region near Z4 = 0 and Zz = 0. Only in these 
enlarged plots do the differences caused by giving Ar the three values, 
—1.0, 0.0, and +1.0, show up. But these differences are very significant 
for the question of stability. Extensions of these plots for negative 
frequencies would yield mirror images of the curves actually plotted 
reflected about the resistance axis. In the case of Za, it is seen that the 
curve extends into quadrants II and III to the left of the origin, that 
is, into the region of negative resistance, at very low frequencies when 
Ar <0, and otherwise stays in the region of positive resistance. The 
curve of Zs extends into quadrants II and III to the left of the origin 
for frequencies near 17.6 kHz when Ar < 0. 
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Fig. 13—Reactance vs resistance plot of impedance Za as frequency varies. Zz, 
is impedance of a pair of NIB units and Z,, is the open circuit impedance of a half 
section of uniform line. 
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Fig. 14—Reactance vs resistance plot of impedance Zz in which Z,, is the open- 
circuit impedance of a half section of uniform line. 


Thus, when Ar < 0, the product Z4Zp can be zero at very low 
frequencies by means of Z,4 and at a considerably higher frequency by 
means of Zz. The latter is responsible for the zero at 17.6 kHz in 
mbZy, shown in Fig. 12, and arises through a resonance of the reactance 
of the booster and that of the open-circuit section of line. It is im- 
possible to show the effects of the zero in Z4 on Z4Zz in Fig. 12 because 
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Fig. 15—Reactance vs resistance plots of Zz and Zo. 


of the very low frequencies (see Fig. 13) at which they occur. Plots of 
the impedance Zy = $VZ4Zz similar to those of Figs. 13 and 14 
touch the origin at a very low frequency or loop around it at the zero 
in Zg when Ar <0. A lumped network, whose impedance may be 
expressed as a rational function of complex frequency s, would have a 
similar plot looping around or touching the origin if one of its roots 
occurred at a complex frequency with positive real part. In these 
circumstances the network is unstable. From this we may infer that 
the NIB line can be unstable in the two ways shown when Ar < 0. 
The way involving Zs depends on other circumstances as well. The 
booster parameters for Figs. 13 and 14 were chosen with respect to the 
line parameters so that a just-stable condition exists for Ar = 0. If 
these parameters were different, e.g., for a larger time constant in the 
booster, Ar could be negative by a limited amount without instability 
at the higher frequency. There is no qualification about the zero 
through Z,4. This always arises in the infinite line for Ar < 0. 

Now we may see why the lumps of negative resistance must be 
reduced with increasing frequency to avoid instability. Consider 
the X vs R diagram of Fig. 15 in which Zs, and 2Z,, are plotted. 
If the booster impedance Z, is a pure resistance, — 1, which cancels 
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all the line resistance, the diagram for Zs, would be that at the left 
which loops around the origin and would be unstable as we have 
seen. But if the negative resistance component of Zz, is reduced in 
magnitude as frequency increases, we may obtain the Zz curve shown 
in the middle, which is just stable. Rg must be positive where Xz = 0. 
Reducing |Rz| for this purpose may be done in such a way as to also 
shape the transmission properties of the NIB section in some desirable 
way, as indicated above. 

These conditions may be summarized by the diagrams of Fig. 16, 
which are sketches made from calculated values of Z4Zz but are not 
plots of actual data. In the top diagram, two conditions, Ar < 0 and 
Ar > 0, are shown, but the booster capacitance is sufficiently large that 
a zero in Zz does not occur. In the bottom left diagram, the same two 
conditions are shown, but here the booster capacitance is sufficiently 
small that the zero in Zg occurs even when Ar > 0. In the bottom right 
diagram, no zero occurs even though Rg may be negative at some 
frequencies. These diagrams may be interpreted as Nyquist diagrams 
for testing the stability of impedances,?*> as the encirclements of the 
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Fig. 16—Nyquist plots of 2. 
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origin in the various cases agree or not with the rules for stability, as 
noted. Only the positive frequency parts of the plots are shown in the 
diagrams for simplicity. 

It can be realized from Figs. 13 and 14 that whether or not a zero 
occurs in Zz depends on the details of the booster involved and cannot 
be determined in general circumstances. 

While the above discussion of stability in the infinite NIB line is not 
a rigorous mathematical one, it is the result of much study, calculation, 
and experiment and is believed to be a good description of the situation. 
These conclusions are also in agreement with a computer investi- 
gation of the nature of the roots of the input impedance of a NIB 
section in which the line was approximated by a rational network and 
the section terminated in various ways. 


5.10.2 Finite line 


The stability of a NIB line consisting of a finite number of sections 
is considerably more complicated than the stability of the infinite 
line because the terminations (if different from Zz) must be considered 
also. Because details of the configuration of line and booster, as well 
as the terminations, are important in these problems, they are more 
appropriately discussed in connection with particular arrangements. 
Two particular situations will be described briefly here. 

The extreme condition for line stability with any passive termina- 
tions whatsoever requires that 


(Ri, — Riz) > 0 for all frequencies. 


In this, Ri: and Ry, are the real parts of Z11 and Zi, the impedance 
parameters of the matrix for the cascade of NIB sections under 
consideration. This condition, first derived by Gewertz,”° has also 
been derived by Llewellyn?’ and others. A simple neat derivation is 
given by Walter H. Ku.” To satisfy this condition, the net resistance, 
Ar, of line and booster must be fairly positive. Thus, what has some- 
times been a goal in NIB work, namely, a line with zero net loss and 
stable for all terminations, is an impossibility. Of course, we cannot 
expect a line equipped with 22-type repeaters to be stable at zero net 
loss for all terminations either. 

On the other hand, when resistance terminations near the low- 
frequency magnitude of Zy are used, the line can be stable even when 
the net resistance, Ar, is negative. This is illustrated in Fig. 17, which 
shows some stability relations for a four-section NIB line. A number 
of other configurations and numbers of sections have been studied, but 
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Fig. 17—Boundary curves between stability and potential instability in finite 
NIB lines terminated with resistance. 


only this one is presented here. In this figure, the ordinate is the net 
resistance Ar per section of line, as already discussed, and the abscissa 
is the resistance R (normalized) which terminates the line. The two 
parts of the solid curve A would meet if more negative values of Ar 
had been plotted. For values of Ar and FR that lie above this curve, the 
input resistance of the line is positive at all frequencies; below it the 
input resistance is negative at some frequency. This curve is, thus, a 
boundary between stability above and potential instability below. 
Instability will exist in fact if, in addition, the input reactance is zero, 
or if it is cancelled by external reactance at the frequency for which 
the input resistance is negative. The dashed line at the top of the figure 
indicates the value of Ar necessary for the line to be stable for any 
termination. The worst termination is a reactance. If some resistance 
k& is added to this worst reactance, the dashed boundary curve B is 
obtained, separating stability (above) from potential instability 
(below). Above the dashed line near the middle of the figure, the NIB 
line is stable for any resistance termination. 

While the curves of Fig. 17 were obtained by means of a rather 
elaborate computing process, they were verified on a laboratory setup. 
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Additional verification was obtained from a different computing 
process. The two-line sections of a single-section NIB line were ap- 
proximated by lumped-element networks. Then the roots of the 
characteristic equation of the whole NIB line section, including 
termination, were found. The conditions of instability or stability, as 
determined from the nature of these roots, was in agreement with those 
corresponding to Fig. 17. 

It is seen from this figure that, for a small range of terminating 
resistance, stability prevails even for negative values of the net 
resistance Ar. However, as the number of sections in the line increases, 
the amount of negative Ar that can be tolerated becomes smaller, 
approaching none for the infinite line, as already determined. Actually, 
for a line of 16 sections, Ar cannot be less than zero. 

The boundary curve A of Fig. 17 is one section of a boundary 
surface over the half-plane of a general terminating impedance. 

It should be noted that the Ar = 0 dividing line discussed here is a 
result of other NIB parameters having been chosen so that line attenua- 
tion is zero and as flat as possible over the appropriate frequency band. 
If these other NIB parameters had been chosen in other ways, the 
dividing line could have been at some positive or negative value of Ar. 

The NIB configuration in which the line ends with half-NIB units 
instead of half-line sections requires a considerably higher value of 
Ar for the same degree of stability. 


5.11 Determining the booster impedance Z, 


Two methods that have been used in determining the required 
booster impedance Z, are described briefly. 

The first method was originally described by K. Bullington,” and is 
a calculation made to determine the booster impedance Z, that will 
give some desired propagation factor P. This may be done using the 
Ax matrix coefficient for one NIB section given in Section 5.5, above. 
If the two forms of this coefficient are equated and solved for Zz, 


we get 
Zz. = 2(Z,/sinh 2p) (cosh P — cosh 2p), 


where 2p and P are the propagation factors for one section of plain line 
and boosted line, respectively. 

Then a network which approximates the computed impedance Zz 
as closely as possible is devised. This may be done formally, by using 
a “negative-impedance converter,” as indicated by John Linvill!’ and 
others, to convert a positive impedance, — Zz, into the negative im- 
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pedance, Z,. Or it may happen that a large part of Zz is supplied by 
some special device and only an additional ‘‘trimming”’ network is 
needed. Small differences between the real network and the calculated 
Zz, can result in stability problems. These should be investigated with 
the real network. 

The second method is to take a particular network configuration 
having a negative resistance and vary its parameters and/or add some 
trimming elements to get desired transmission properties and stability 
for the resulting NIB line. 

The calculated performance curves shown in Figs. 6, 7, 8, and 9 
were obtained from NIB sections derived by the second method using 
the booster circuit (Fig. 23) devised by L. A. Meacham.!* Two of these 
curves are replotted in Fig. 18. 

To compare with these, one example using the first method is given 
also. For this, the third-degree maximally flat characteristic was taken 
as the desired transmission curve. If this is expressed in complex 
frequency form, i.e., 

Y =1+ 25 + 29? + 53, 
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Fig. 18—Attenuation and phase curves resulting from two forms of NIB units in 
NIB lines. 
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- Fig. 19—Reactance vs resistance plots of the two forms of NIB units used in 
ig. 18. 


then compatible amplitude and phase curves are realized. The transmis- 
sion curves for the NIB section designed in this way are shown in Fig. 
18, designated maximally flat. 

Curves, Xz, vs Rx, describing the negative impedances Zz, derived 
in the two ways, are plotted in Fig. 19. 

A good computer program is essential to the successful use of either 
method. How these two methods have been used in actual work will 
be mentioned in the section on history, which follows. 


5.12 Pulse transmission 


In the transmission curves of Fig. 18 for the Meacham unit, the 
booster parameters were chosen so that the attenuation curve of the 
NIB line would be as flat as possible. The parameters can also be 
chosen so that the phase curve of the line is as linear as possible. This 
makes for a minimum of distortion in pulse transmission as indicated 
in oscillograms shown in Ref. 18. Photographs of pulse waveforms 
received through NIB lines formed in both of these ways are shown 
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in Fig. 24. NIB parameters were chosen for flat attenuation for 
Fig. 24a and b and for the linear phase in Fig. 24c and d. 


5.13 Temperature compensation 


In most transmission lines, it is desirable that compensation be 
made for the variation in attenuation caused by temperature varia- 
tions. Several methods have been tried. When the NIB unit is the 
Meacham one, the compensation may be accomplished quite readily 
in many situations, as mentioned in Ref. 18, by using a tempera- 
ture-sensitive resistor as a part of one of the NIB resistors. This 
method is particularly simple and effective for underground cable 
when the temperature-sensitive resistor can be underground also at 
essentially the same temperature as the cable. 


5.14 Interference susceptibility 


One problem which may arise with the use of NIB lines, especially 
when series units are used, is that of interference currents. Even if the 
line is well balanced so that longitudinal-induced interfering currents 
do not appear in the metallic path, they must pass through the negative- 
impedance units. If these currents are large compared with the signal 
current, they may carry the total current beyond the negative-im- 
pedance region of the unit and so cause distortion in the signal (see 
Tig. 2). To avoid this, a much larger current capacity in the NIB 
unit than would be necessary for the signal alone must be provided, 
or operation in a region of high interference must be avoided. 


Vi. HISTORY OF NEGATIVE-IMPEDANCE BOOSTING 
6.1 Crisson 


It was mentioned above that the possibility of using negative- 
resistance boosters to reduce the attenuation of transmission lines 
was recognized before 1919.‘ But the first theoretical and experimental 
consideration of the use of such boosters in telephone lines was de- 
scribed by Crisson in his 1931 paper in The Bell System Technical 
Journal.!* However, it is believed that this consideration was of the 
occasional or single addition of a negative-resistance unit to a line 
rather than of their periodic addition.” 


6.2 Bullington and Edwards 


The first appearance of the concept of introducing lumps of negative 
resistance periodically into a line to obtain a new ‘‘uniform” line with 
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desirable transmission properties seems to be in the work of K. Bulling- 
ton? in 1940 and 1941. This effort was stimulated partly by the avail- 
ability of a quite small thermistor unit which had a negative-resistance 
region in its voltage-current curve in the audio-frequency range. At 
that time, it was recognized that the use of a net negative resistance 
at all frequencies would cause instability. However, it was also thought 
that if each negative-impedance section could meet the Nyquist 
stability requirement for impedances (see Refs. 23 and 25, and Fig. 
19), any desired number of such sections could be connected in tan- 
dem with overall stability. While these point contact units were . 
mechanically fragile, a stable net gain of 2 to 3 dB over the frequency 
range of 200 to over 3000 Hz was demonstrated on a 54,000-foot 
section of 19-gauge cable pair. Five negative-impedance series-type 
units were inserted at 12,000-foot intervals with 3000-foot end sections. 
The application intended was for voice-frequency toll circuits. Con- 
siderable analyses were made™ that are quite complete and very 
readable. The discussion covers several of the general properties of 
Section V, particularly Sections 5.4, 5.5, 5.6, 5.7, and 5.10, that is, 
reflection effects and cutoff, propagation formulas, image impedance, 
and stability. The first method (see Section 5.11) of choosing the 
booster impedance Zz, was used, assuming zero loss and linear phase 
in the transmission band for a start. The thermistor impedance pro- 
vided a negative resistance that decreased with frequency and also a 
positive reactance. Both of these were a major part of the desired Zr. 
The remainder was made up by a simple added network. 

An interesting suggestion, which does not seem to have been at- 
tempted, was made by Bullington for the termination of finite NIB 
lines. His calculations indicated that if the line were ended with 
Q.7-line sections instead of 0.5-line sections, the correct terminating 
impedance would be a very nearly constant resistance plus an induct- 
ance, which would be simpler to construct than the image impedance, 
particularly near the cutoff frequency. This particular project was 
ended by the more pressing demands of World War II. 


6.3 Merrill and the E repeater 


The next stage of effort produced the first E-type negative impedance 
repeaters, in the development of which the work of J. L. Merrill, 
Jr.!92° occupies a large part. Merrill had done some work before the 
war in pursuing the work begun by Crisson. As a part of this effort, 
the analysis of K. G. Van Wynen, referred to above, dealt with the 
impedance and transmission properties of networks equivalent to the 
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boosted sections. While the use of an E repeater introduces a negative 
resistance in series with the line to reduce its loss, the concept of use 
is quite different from that of a line with a considerable number of 
negative-impedance units added periodically along its length. The E 
repeaters are used only in central offices and usually with only one or 
two in a given line. Hence, they disturb the uniformity of the line and 
cause reflections. However, the latest versions of these repeaters have 
been installed at a great rate and are filling a real need in making gain 
available for the trunks of the exchange plant. One of the important 
factors in the simplicity of the use of E repeaters is that the de con- 
tinuity of the circuit is maintained and so de and very-low-frequency 
signaling currents go through without rearrangement or new apparatus. 
Also, if the repeater fails, the unrepeatered line is still available. How 
this comes about may be seen from Fig. 20 which shows a block 
diagram of the E1 series repeater and how it is connected. The negative 
impedance is coupled into both conductors of the line by means of the 
transformer. The negative impedance is generated by a device called 
a negative-impedance converter which, when a passive impedance Z, 
is connected to two of its terminals, presents an impedance of approxi- 
mately —Z, at its other two terminals. The “simplicity” factor 
mentioned above emphasizes the fact that one of the principal differ- 
ences between conventional amplifying and the use of negative resist- 
ance lies in the manner in which the source of energy replenishment 
for the signal is coupled into the line. 

The first model of the E repeaters, the E1, was introduced in 1948.1° 
The converter used a twin triode vacuum tube with positive feedback 
coupling, and the Z, network could be adjusted for various amounts 
of negative impedance, or loss reduction. The repeater can be used in 
coil-loaded lines or nonloaded lines. 








LINE 
a 


Fig. 20—Method of coupling E-type negative-impedance repeater into trans- 
mission line. 


1880 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1974 





RESIDUAL 


«ES 0 Loss — —— —~——-—~——~ > LOSS 
ZA 
Z| = V2ZaZp = Zo 
4:) 


Za = NZo L-180° 


Zz 
ZB = e+ 180° 


Fig. 21—Equivalent circuits for E-type negative-impedance repeaters. 


As discussed above, the introduction of these impedance lumps into 
a line causes reflections or echoes when the same line Z, termination 
is retained. Hence, the principal use of El repeaters has been in 
interoffice trunks rather than in toll trunks, where, because of the 
longer delays involved, echoes can be much more objectionable. 
Where there is only one repeater in the line, as in the majority of these 
cases, the amount of gain and the amount of reflection are directly 
proportional to the magnitude of the negative resistance used. 

As mentioned above, if series and shunt negative impedances of 
appropriate magnitude are combined properly, not only will losses be 
reduced, but the characteristic impedance of the line may be matched 
approximately. This was done in the E23 repeater” to obtain much 
smaller reflections than those that accompany the El repeater, and 
so be usable in toll-connecting trunks. This process is indicated in the 
diagrams of Fig. 21 taken from Ref. 20. It shows how the negative- 
impedance repeater cancels a part of the line loss leaving a residual 
propagation factor of az + j82, while also matching the characteristic 
impedance Z, of the line. It also shows that by means of the bridged-T 
structure, only two instead of three negative-impedance units need be 
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built, and how the two are proportioned. The coils provide coupling 
into the line as well as being part of the bridged-T structure. 

A recent version of the E repeaters, the E6, is like the E23 in that 
it matches the line, but it is different in some other respects.”* It was 
introduced about 1960. It uses transistors instead of vacuum tubes in 
the basic converter circuit devised by R. L. Wallace and J. G. Linvill.!” 
In the physical embodiment, the amount of negative resistance, or 
gain, adjustment is separated from the line impedance-matching 
function to simplify the operation of the repeaters. 

The question may be asked, since two amplifiers are used to generate 
the two negative impedances, why not use them to make a 22-type 
repeater? There seems to be no simple direct answer to this question. 
In some situations or at some times, one of the methods may provide 
an advantage over the other and vice versa. l’or example, when a 
simple way of providing continuity for de signaling along with gain 
is needed, there is an advantage in using a negative-impedance re- 
peater. The advantage may lie with the 22-type repeater in other 
circumstances. 

The E repeaters provide a simple way to add small amounts of gain 
to exchange trunks. But in their present conception, they do not 
provide the broad possibilities that can be envisioned with the use of 
periodically added negative impedances to obtain various desirable 
transmission characteristics. 


6.4 Schott and Wallace 


During the period 1953 to 1956, L. O. Schott did some design and 
experimental work on NIB lines. Two particular situations were 
considered. In the first, six negative-impedance units were spaced 
one-third mile apart in 22-gauge cable to give a signal band from about 
7 kHz to 100 kHz for possible carrier applications. The negative- 
impedance unit shown in Fig. 22a was similar in form to the El 
repeater, but with the converter using the transistor circuit of Linvill.!” 

A considerably different negative-impedance-unit circuit, lig. 22b, 
suggested by R. L. Wallace, was used in the second situation. Instead 
of the two like transistors used earlier, one npn, and one pnp were used 
in the new arrangement. With this change, the coupling transformer 
was eliminated and the current for powering the unit allowed to flow 
through the line and units in series. While this requires two units, one 
for each conductor, it brings a great simplification to the circuit with 
the opportunity for miniaturization. These units were applied to six 
3900-foot sections of 32-gauge cable, providing a transmitting band 
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LINE LINE 


(a) “T (b) T 


Fig. 22—Circuit diagrams for two negative-impedance converters: (a) Linvill 
circuit, and (b) Wallace-Schott circuit. 


from 0.1 kHz to 7.5 kHz. The RLC network in series with the unit was 
added to provide a margin against instability. 


6.5 Hoth and Ross 


During the period from about 1954 to 1957, a study in considerable 
detail was made by D. F. Hoth and W. L. Ross concerning the possi- 
bility of using negative-impedance boosting to provide gain in exchange 
trunks. Their main conclusion was that, though their complete experi- 
mental NIB system operated successfully and appeared to offer small 
cost savings, the E repeaters for voice and the Tl system for carrier 
seemed to hold more promise in the exchange trunk area. This con- 
clusion appears to have been influenced, in part at least, by some 
unsatisfactory aspects of the negative-impedance unit used. 

This unit was an avalanche transistor with one resistor and one 
capacitor connected externally. The negative resistance is generated 
by the avalanche discharge process within the transistor and is the 


NEGATIVE-IMPEDANCE UNITS 1883 


series type. Some of the unsatisfactory properties were lack of uni- 
formity of the transistors, nonlinearity of the negative resistance 
developed, and comparatively high voltage required at each unit. 

A real system was set up consisting of three 9000-foot sections of 
26-gauge cable, with provision being made for signaling, powering, 
and matching of the line to office impedance. The line had an image 
impedance of about 300 ohms and a sharp cutoff near 8 kHz. The low 
image impedance is a distinct advantage for reduced crosstalk. Regu- 
lation of net loss was by means of a 40-Hz pilot tone. The change of 
amplitude of this tone in response to cable-temperature change was 
used to shift the de line current, which in turn would change the amount 
of negative resistance because of the nonlinearity of the NIB units. 

The shape of the booster impedance Zz, was chosen by the first 
method described in Section 5.11 and the actual unit made to approxi- 
mate this fairly well over the transmission band. Instead of giving 
each NIB section sufficient loss so that the system would be stable 
with any passive termination, they designed the sections for zero net 
loss and then put terminating networks at each end so that the system 
was stable for any passive termination beyond these networks. This is 
equivalent to restricting the termination of the actual NIB sections 
to lie in a certain impedance range. 

Considerable analysis was done on the problem of stability, partic- 
ularly on the conditions required for stability with any passive termi- 
nation. 


6.6 Other projects and general comments 


While there have been quite a number of projects, beyond these 
listed above, in which the use of negative impedances in transmission 
lines has been studied, the list is representative except for the work 
of L. A. Meacham dealt with separately in Section VII. 

A handicap in many of the projects was the goal of a line with zero 
net loss which at the same time could tolerate terminations of any 
passive impedance whatsoever. We have seen that this is an impossible 
goal. The closer we approach zero net loss, the more restricted the range 
of terminating impedance becomes until, at zero, termination must be 
perfect except in the few special situations indicated in Section 5.10.2. 
But similar restrictions apply to other ways of reducing line loss also. 
It was shown clearly in the early years of repeatered lines*!° that 
irregularities in the line and imperfect terminations severely limited the 
amount of gain that could be used in a line. A paper, “‘Some J'unda- 
mental Properties of Transmission Systems,’’?? was written by IF. B. 
Llewellyn in part to show that the restrictions that are necessary for 
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transmission circuits to be stable are essentially the same whether 
unilateral amplifiers or negative impedances are used to reduce the 
losses. 


Vil. MEACHAM’S WORK 


In 19638, a new look at the use of negative-resistance lumps placed 
uniformly in a line was begun. 


7.1 Meacham NIB unit 


The point of departure here was a fairly simple realization of a 
negative-resistance circuit. The generation of negative resistance was 
not accomplished in a single device, as in some previous situations, 
but by the cross coupling of two transistors in a simple way. Though 
derived independently, it is similar to the Wallace circuit of Fig. 22b 
in that it uses an npn and a pnp transistor. But the Meacham circuit 
does not use coupling capacitors. Two advantages come from this: 
the negative resistance is usable down to de, and the physical bulk of 
the capacitors is eliminated. The circuit could be realized also by a 
single pnpn device. Later in the investigation, the negative resistance 
was made highly linear by the addition of two diodes, and the trans- 
mission shaping possibilities were enlarged by adding a resistance and 
capacitance in shunt.!8 The Meacham negative-impedance circuit and 
its voltage-current characteristic are shown in Fig. 23. These negative 
impedances were used for the transmission curves of Figs. 6, 7, 8, 
and 9. And while these were plotted from calculated data, measured 
performances of the real physical lines are always in very good agree- 
ment with those calculated. Some work was done to demonstrate that 
the NIB units could be built in very small integrated-circuit form. 


7.2 Field tests 


Two field tests were successfully completed and numerous laboratory 
demonstrations made. The lines constructed for these tests provided 
excellent transmission with low phase and nonlinear distortion for a 
variety of signals. Lines in both tests were about 32 miles long. One 
was a single circuit in a suburban exchange area, part aerial cable and 
part underground cable.!® The other test involved three circuits using 
three order wire pairs in the L4 spur cable from Netcong to Newark, 
New Jersey. Extensive measurements of these were made over a 
period of about a year. They were operated at a net loss of about 3 
dB and had very low noise and a satisfactory return loss. The longi- 
tudinal interfering currents were negligible. Signals from a 202D Data 
Set system (frequency-shift keying at 1200 bits/second) with its 
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Fig. 283—Circuit diagram for Meacham negative-impedance unit with equivalent 
circuit and voltage-current curve. 


equalizers removed were sent through one of these lines having a 
bandwidth of about 5 kHz with no noticeable distortion (see Fig. 
24a). With a slight rearrangement at the terminals, baseband rec- 
tangular binary pulse signals at a 12-kHz rate were easily transmitted, 
as may be seen from the eye diagram of Fig. 24b. These lines were 
designed for essentially flat attenuation curves. If the NIB unit 
parameters are chosen to have the most linear phase curve instead, 
pulse transmission is even better. This is shown in Figs. 24c and 24d 
for such a line consisting of nine 6000-foot sections. Figure 24c is the 
response to a single 50-us rectangular pulse and Fig. 24d is the response 
to a pseudo-random-pulse train at 14 Kbauds and 16 levels. 

Suitable means were worked out for precise measuring of the NIB 
units and for measuring the overall net resistance of the operating line. 


7.3 Papers published 


Two papers have been published on parts of the work, one by L. A. 
Meacham!’ and the other by A. L. Hopper.” The purposes of the 
present paper are: (7) to present a comprehensive general view of the 
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Fig. 24—-Pulse transmission in NIB lines. Parameters chosen for flat attenuation 
in (a) and (b) and for linear phase in (c) and (d). 


properties of NIB lines (Section V), (zz) to describe the earlier work 
on negative-impedance boosting, and (277) to present additional 
results not given in the two previous papers, notably the work on 
stability outlined in Section 5.10. The author of the present paper 
began work with this project late in 1968. 

The paper by Meacham is a general one describing the negative- 
impedance unit, how it is used in the line, and the improved trans- 
mission which results in various situations. The one by Hopper is on 
the design of NIB lines following the viewpoint mentioned above, 
namely, varying the parameters of the NIB unit to achieve desired 
transmission properties in a NIB line made up of these units added 
periodically to a particular uniform line. 

In all the work on this project, the approach was not to try to build 
a NIB unit to have a certain impedance Z_z, but to vary the parameters 
of this particular NIB circuit configuration to obtain certain desirable 
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transmission properties of the combination of transmission line plus 
booster. This provides advantages as well as restrictions. 


7.4 Design of NIB lines 


The use of the high-speed digital computer was essential in carrying 
out this process. Details of this method and design results for a number 
of uniform lines and various bandwidths are given in Hopper’s paper. 
The curves of Fig. 4 on the relation between spacing and bandwidth 
are one result of this work. These designs are for zero net loss and are 
just stable. Some backing away from this condition is necessary for 
practical operation. The transmission objective in most of these 
examples is as flat an attenuation curve as possible. However, it is 
possible to make a somewhat different choice of parameters to get as 
nearly linear phase curve as possible. Some work, not published, was 
also done in choosing parameters so that some measure of distortion in 
pulse transmission would be a minimum. In this work of designing 
NIB lines, the two NIB units in the two wires of one section of line 
are represented by the equivalent circuit shown in Fig. 23. 


7.5 Choosing element values for the NIB unit 


The designs for the lines are expressed in terms of the parameters 
of the equivalent circuit. How these are translated into the element 
values of the real physical circuit (see Fig. 23) is described in un- 
published notes by D. M. Chapin and is outlined below. 

An important parameter of the Meacham NIB unit is the voltage 
Vx, which is the difference between the emitter-base voltage for a 
transistor and the voltage drop across its associated diode. The diodes 
were added to counteract the nonlinearity of the transistor emitter- 
base junction. Most of the units were built with silicon transistors and 
germanium diodes that provide a value of about 0.5 V for Vx. For 
smaller values of Vx, the current range over which the negative 
resistance appears is less, and disappears for Vx = 0. The circuit will 
operate as a negative resistance without the diodes, but will not be as 
linear. 

The first element to be chosen is the resistor Re. There are three 
ways this may be chosen, but the most direct is the following: 


fey | Ve 


(64 
Ha oto (Qa —1)D ’ 


where a@ is the a of the transistor, Vx the parameter mentioned above, 
Rk, is the positive resistance in the equivalent circuit, and D is the 
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current range of the negative resistance 7, — J» in the V-I curve of 
Fig. 28. The parameter R, is 
R, 4Rik, 


~ Ry Ret 


and, as mentioned above, applies to the two units, whereas the element 
values are being derived for a single unit. Irom studies of the NIB 
line process, a very good value for R, which is used in most designs is 


R, = 100 ohms. 


The above choice for R2 is nearly the same as that which would have 
been obtained to minimize the total de voltage drop across units and 
line per section. Or it could have been done by 


Re = Vx/Ie, 


where J; is the value of current at which the negative resistance begins 
in the V-I curve. 
Then, R; is determined by 


_ Rk, 
aa a R; 
It may be shown that 
Rn R (2a — 1)R.—- Ri 
a Ri+ Re 
and 
C3 Rit Re 


Cr = 





2 [ (2a =s 1)Re an Rey 


If Ry is the total (both conductors) resistance of the nonboosted 
line per section, we have Ar = R, — R, + Ry with which to determine 
the required Rf,, where Ar is the net low-frequency resistance per 
section as used in Section 5.10. From these we have for R; 


R= (he + Rr 4 Ar) (Ri + fa) | 
os 2L (2a — 1)R2 — Ri] 





If the circuit is operated without the diodes, Vx is replaced by V2. 

The choice of R, and C, is discussed in Hopper’s paper, Ref. 22. 
This completes the translation of the equivalent-circuit parameters 
into the elements of the real NIB circuit. 

To keep the net resistance Ar close to its desired value, it is necessary 
that negative resistance (R, — R,)/2 be adjusted quite precisely. 
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The simplest way to do this is to build the units with elements of 
reasonable tolerances except for R3. Then, with a standard variable 
resistor connected in place of R3, this variable resistor is adjusted 
until the measured value of (R, — R,)/2 reaches its desired value. 
The variable resistor is then replaced by a fixed one of this value. 

The measurement is simply made by superimposing a small low- 
frequency current on the bias current, which is (J, + J,)/2. A standard 
variable positive resistance in series with the NIB unit is then adjusted 
for a null in voltage across the combination which indicates equality 
of positive and negative resistance. 


7.6 Temperature compensation 


Compensation for temperature variations in the lines of the two 
field tests was made by replacing part of the fixed resistor R3 in Fig. 23 
by a resistor wound with a nickel alloy having a suitable positive 
temperature coefficient. Thus, the negative resistance of the NIB 
unit, R,, increases as the cable resistance Rx increases. This method is 
particularly effective and simple when the temperature-sensitive 
resistor can be in the same temperature environment as the line. 


Vill. CONCLUSIONS 


Knowledge about the properties of NIB transmission lines, which are 
lines into which negative-impedance units have been inserted periodi- 
cally, has been advanced considerably by this latest investigation. The 
distinctive properties of these lines stem largely from the way the 
sources of energy replacement, that is, the negative-impedance units, 
are coupled into the line. The cost and bulk of coil or resistance hybrids, 
necessary when unilateral amplifiers are used, are avoided and bilateral 
transmission is provided directly. The reflections caused by the 
insertion of these units into the line cancel each other up to a certain 
cutoff frequency when the units are uniformly spaced and the line is 
terminated approximately in its image impedance. There is an approxi- 
mately inverse relationship between this cutoff frequency and the 
spacing of the units. 

Two separate field test lines, each about 32 miles long, were con- 
structed and operated for some time. One of these was in a suburban 
exchange area and used partly aerial and partly underground cable. 
The other used order wires in the L4 spur cable (underground) between 
Netcong and Newark, New Jersey. Both of these demonstrated, in 
agreement with analysis, that these NIB lines can be operated stably, 
with a margin, at a net loss of about 2 dB and that they can provide 
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high-quality transmission with low-phase and nonlinear distortion for 
speech or pulse signals. Because the NIB units are effective down to 
de, baseband transmission of pulse signals is possible and was demon- 
strated on the test lines. 

Two problems have held back the use of these lines. The first has 
to do with the effect of temperature variations of the transmission 
lines. A simple method for the correction of these variations which 
has been effective in the test lines is not universally applicable. It 
requires the temperature of the booster to be simply related to that of 
the linc. The second is the susceptibility of the line to certain kinds of 
interfering currents and is more fundamental, as no satisfactory way 
to overcome it has been found. In well-balanced conventional trans- 
mission lines, the effect of induced longitudinal interfering currents 
is negligible. In NIB lines, these currents must flow through the NIB 
units and if the interfering currents are large compared with the signal 
currents, they may cause the total dynamic current to go outside the 
region of negative resistance and, hence, cause large distortion. This 
effect limits the use of NIB lines to environments where interfering 
currents are small. The interfering currents were negligible in the two 
test lines, and there are probably many such situations. Nevertheless, 
it is apparent that the use of the negative-impedance method to reduce 
attenuation is worthy of further consideration in certain situations, 
as for example in those which are similar to the Netcong-Newark field 
test, even though because of the above difficulties the method may not 
be suitable for use in all situations. 
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